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Abstract 

A non-backtracking walk on a graph is a directed path such that no edge is the inverse of its 
preceding edge. The non-backtracking matrix of a graph is indexed by its directed edges and 
can be used to count non-backtracking walks of a given length. It has been used recently in 
the context of community detection and has appeared previously in connection with the Ihara 
zeta function and in some generalizations of Ramanujan graphs. In this work, we study the 
largest eigenvalues of the non-backtracking matrix of the Erdos-Renyi random graph and of the 
Stochastic Block Model in the regime where the number of edges is proportional to the number 
of vertices. Our results confirm the ’’spectral redemption conjecture” that community detection 
can be made on the basis of the leading eigenvectors above the feasibility threshold. 

1 Introduction 

Given a finite (simple, non-oriented) graph G = {V,E), several matrices of interest can be 
associated to G, besides its adjacency matrix A = ( 1 {„ e)u,v^v- In this work we are interested 
in the so-called non-backtracking matviyi of G, denoted by B. It is indexed by the set E = {{u,v) : 
{u, t>} € E} of oriented edges in E and defined by 

= 1(62 = /i)l(ei ^ A) = 1(62 = /i)l(e /-p, 

where for any e = {u,v) G E, we set ei = m, 62 = v, e~^ = {v,u). This matrix was introduced 
by Hashimoto [ 12 ]. A non-backtracking walk is a directed path of directed edges of G such that 
no edge is the inverse of its preceding edge. It is easily seen that for any fc > 1 , Up counts the 
number of non-backtracking walks of fc -I- 1 edges on G starting with e and ending with /. 

Our focus is the spectrum of B, referred to in the sequel as the non-backtracking spectrum of 
G, when G is a sparse random graph. Specifically we shall characterize the asymptotic behavior 
of the leading eigenvalues and associated eigenvectors in the non-backtracking spectrum of sparse 
random graphs in the limit n —> 00 where n = \V\. 

The random graphs we consider are drawn according to the Stochastic Block Model, a 
generalization of Erdos-Renyi graphs due to Holland et al. [ 13 ]. In this model nodes v G 
V are partitioned into r groups. An edge between two nodes u, v is drawn with probability 
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W[a{u),u{v))/n, where (j{u) G [r] denotes the group node u belongs to. Thus when the r x r 
matrix W is fixed the expected node degrees remain of order 1 as n —>■ oo. We focus moreover 
on instances where the fraction of nodes in group i converges to a limit 7r(*) as n —>■ oo. 

An informal statement of our results for eigenvalues is as follows. Let G be drawn according 
to a Stochastic Block Model with fixed number r of node groups such that all nodes have same 
fixed expected degree a > 1. Let denote the leading eigenvalues of the expected 

adjacency matrix A := E(A), ordered so that = a > \^ 2 \ > ■ ■ ■ > |y^r|- Let rg < r be such 
that Ip-rg+il < y/a < l/ir-ol- Then the rg leading eigenvalues of B are asymptotic to p-i,... 
the remaining eigenvalues A satisfying |A| < (1 + o{l))^/a. 

Community detection 

Our primary motivation stems from the problem of community detection, namely: how to es¬ 
timate a clustering of graph nodes into groups close to the underlying blocks, based on the 
observation of such a random graph G1 Decelle et al. [8] conjectured a phase transition phe¬ 
nomenon on detectability, namely: the underlying block structure could be detected if and only 
if 1/^21 > a/o. 

In the case of two communities with roughly equal sizes (7r(l) = 7r(2) = 1/2) and symmetric 
matrix W, the negative part (impossibility of detection when \fjL 2 \ < \/ol) was proven by Mossel 
et al [25]. As for the positive part (feasibility of detection when |/i 2 | > y/A), it was conjectured 
in [8] that the so-called belief propagation algorithm would succeed. Krzakala et al. [20] then 
made their so-called “spectral redemption conjecture” according to which detection based on 
the second eigenvector of the non-backtracking matrix B would succeed. 

Recently a variant of the spectral redemption conjecture was proven by Massoulie [23]: the 
spectrum of a matrix counting self-avoiding paths in G allows to detect communities through 
thresholding of the second eigenvector. More recently and independently of [23], an alternative 
proof of the positive part of the conjecture in [8] was given by Mossel et al. [24], based on an 
elaborate construction involving countings of non-backtracking paths in G. 

The two approaches of [23] and [24] to proving the positive part of the conjecture in [8], 
while both relying ultimately on properties of specific path counts, differ however in the following 
respects. The method in [23] relies on a clear spectral separation property but its implementation 
is computationally expensive, as the counts of self-avoiding walks it relies upon take super- 
linear (though polynomial) time. The method in [24] is computationally efficient as it runs in 
quasi-linear time, but the proof does not establish a spectral separation property. The other 
two methods conjectured to achieve successful reconstruction, namely belief propagation and 
analysis of non-backtracking spectrum, are computationally efficient and they are motivated by 
a clear intuition as described in the spectral redemption conjecture. 

Our present work proves the spectral redemption conjecture. More generally by characteriz¬ 
ing all the leading eigen-elements it determines the limits of community detection based on the 
non-backtracking spectrum in the presence of an arbitrary number of communities. 
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Ihara zeta function 

Hashimoto introduced the matrix B in the context of the Ihara zeta function [12]. We have the 
identity 

det{I- zB) = 

where Cg is the Ihara zeta function of the graph, refer to [28, 19, 16, 29]. It follows that the 
poles of the Ihara zeta function are the reciprocal of the eigenvalues of B. Our main results 
have thus consequences on the localization of the poles of the zeta function of random graphs. 

Weak Ramanujan property 

Our result also has an interpretation from the standpoint of Ramanujan graphs, introduced 
by Lubotzky et al. [22] (see Murty [26] for a recent survey). These are by definition fc-regular 
graphs such that the second largest modulus of its eigenvalues is at most 2\/k — 1. By a result 
of Alon and Boppana (see [27]) for fixed k, fc-regular graphs on n nodes must have their second 
largest eigenvalue at least 2\/k — 1 — o(l) as n ^ oo. Hence Ramanujan graphs are regular 
graphs with maximal spectral gap between the first and second eigenvalue moduli. A celebrated 
result of Friedman [9] states that random fc-regular graphs achieve this lower bound with high 
probability as their number of nodes n goes to infinity. 

Lubotzky [21] has proposed an extension of the definition of Ramanujan graphs to the non¬ 
regular case. Specifically, G is Ramanujan according to this definition if and only if 

maxjjAl : A G spectrum(A), jA] ^ p} < cr, 

where A is the adjacency matrix of G, p its spectral radius, and a the spectral radius of the 
adjacency operator on the universal covering tree of G. 

Using the analogy between the Ihara zeta function and the Riemann zeta function. Stark 
and Terras (see [16]) have defined a graph to satisfy the graph Riemann hypothesis if its non¬ 
backtracking matrix B has no eigenvalues A such that jA] G {^/pb, Pb), where pB is the Perron- 
Frobenius eigenvalue of B. Interestingly, a regular graph G is Ramanujan if and only if it satisfies 
the graph Riemann hypothesis (see [26] and [16]). Thus the graph Riemann hypothesis can 
also be seen as a generalization of the notion of Ramanujan graphs to the non-regular case, 
phrased in terms of non-backtracking spectra rather than on spectra of universal covers as in 
the definition of Lubotzky [21]. 

Our results imply that for fixed a > 1, Erdos-Renyi graphs ^(n, a/n) have an associated non¬ 
backtracking matrix B such that pB ^ ct and all its other eigenvalues A verify jA] < y/a + o(l) 
with high probability as n —>■ oo. In this sense, Erdos-Renyi graphs asymptotically satisfy the 
graph Riemann hypothesis, itself is a plausible extension of the notion of Ramanujan graphs to 
the non-regular case. This may be seen as an analogue of Friedman’s Theorem [9] in the context 
of Erdos-Renyi graphs. Similarly, for the Stochastic Block Model, our main result is analogue 
to recent results on the eigenvalues of random n-lifts of base graphs, see [10, 6]. Interestingly, in 
[6] , the methods developed in the present paper are adapted to lead to a new and simpler proof 
of Friedman’s Theorem and its extensions to random n-lifts. The random graphs studied here 
will require a more delicate analysis. 
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Organization 

The paper is organized as follows. We start in Section 2 with preliminaries on non-backtracking 
matrices. We state our main results in Section 3, namely Theorems 3 and 4 characterizing 
properties of non-backtracking spectra of Erdds-Renyi graphs and Stochastic Block Models 
respectively, and Theorem 5 establishing their consequence for community detection. 

In Section 4, we provide the algebraic ingredients we shall need. Specifically we establish 
general bounds on the perturbation of eigenvalues and eigenvectors of not necessarily symmetric 
matrices that elaborate on the Bauer-Fike Theorem. 

In Section 5 we give the proof architecture for Theorem 3 on the non-backtracking spectrum 
of Erdds-Renyi graphs, and detail a central argument of combinatorial nature, namely a rep¬ 
resentation of the non-backtracking matrix B raised to some power £ as a sum of products of 
matrices, elaborating on a technique introduced in [23]. 

In Section 6 we detail another combinatorial argument needed in our proof, namely we 
establish bounds on the norms of the various matrices involved in the previous matrix expansion. 
The latter norm bounds are established by the trace method, adapting arguments due to Fiiredi 
and Komlos [11]. 

Section 7 gives the proof strategy for Theorem 4 on non-backtracking spectra of Stochastic 
Block Models. 

In Section 8 we establish convergence results on multitype branching processes that extend 
results of Kesten and Stigum [18, 17]. These are then used in Section 9 to characterize the 
local structure of the random graphs under study. Specifically we relate the local statistics of 
Stochastic Block Models to branching processes via coupling, and then establish weak laws of 
large numbers on these local statistics. These probabilistic arguments together with the previous 
algebraic and combinatorial arguments allow us to conclude the proofs of Theorems 3 and 4. 
Section 11 contains the proof of Theorem 5 on community detection. 

2 Preliminaries on non-backtracking matrices 

In this section, we explain how the singular value decomposition of for £ large can be used 
to study the eigenvalues and eigenvectors of B. We also comment on analogs of some classical 
inequalities known for adjacency or Laplacian matrices of regular graphs. 

We set m = \E\. A priori, B is not a normal matrix. We are interested in its eigenvalues 
Xi{B),..., Xm{B) ordered non-increasingly, |Ai(R)| > ... > |Am(.B)|- The Perron-Frobenius 
Theorem implies notably that Xi{B) is a non-negative real. If G is connected, Xi{B) is equal 
to the growth rate of the universal cover of G (see Angel, Friedman and Hoory [1]). 

2.1 Oriented path symmetry 

An important remark is that despite B not being a normal matrix, it contains some symmetry. 
More precisely, we observe that {B*)ef — Bfe = B^-if-i. Introduce for all x G the notation 

Xe = Xg-i, e G E. (1) 
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It is then easy to check that for x,y G K'®' and integer k > 0, 

{y,B'‘x) = {B’^y,x). (2) 

In other words, if P denotes the involution on Px = i, we have for any integer k > 0, 

B^P = PB*’^. 


Hence B^P is a symmetric matrix (in mathematical physics, this type of symmetry is called 
PT-invariance, PT standing for parity-time). If {<Jj,k), 1 < J < w, are the eigenvalues of B^P 
and (xj^k), 1 < i < w, is an orthonormal basis of eigenvectors, we deduce that 

m 

B^ = J2<^xkXj,kxlk- ( 3 ) 

i=i 


We order the eigenvalues. 

From Perron-Frobenius theorem, Xi^k can be chosen to have non-negative entries. Since P is 
an orthogonal matrix, 1 < J < m, is also an orthonormal basis of M®. In particular, (3) 

gives the singular value decomposition of B^. Indeed, if sj^k = \<Xj,k\ and yj^k = sign{aj^k)xj^k, 
we get 

m 

='^Sj,kXj,kylk- ( 4 ) 

3 = 1 

This is precisely the singular value decomposition of B'^. 

For example, for A: = 1, it is a simple exercise to compute (crjp)i<j<m. We find that the 
eigenvalues of BP are (deg(u) — 1), 1 < u < n, and —I with multiplicity m — n. In particular, the 
singular values of B contain only information on the degree sequence of the underlying graph 
G. 

For large k however, we may expect that the decomposition (3) carries more structural 
information on the graph (this will be further discussed in Subsection 2.2 below). This will be 
the underlying principle in the proof of our main results. For the moment, we simply note the 
following. Assume that B is irreducible. From Perron-Frobenius theorem, if ^ is the Perron 
eigenvector of B, ||^|j = 1, then for any n fixed, 

Xi{B) = \iuY <7\^^ and lim - ^|| = 0. (5) 

k—¥oo ’ k—^oo 


A quantitative version of the above limits will be given in the forthcoming Proposition 7. An¬ 
other consequence of (3) is that, for i ^ Xi^k and Xj^k should be nearly orthogonal if these 
vectors converge as A: —>■ oo. Indeed, a heuristic computation gives 

^ \ _ {,B X^k^B Xj^k) _ {,B X'l^k^Xj^k) {B Xi^2k: Xj^2k} _ ^i,2k{Xi^2kiXj2k) _ ^ 

\^i,k’) ^j,k) — ’ ■ ~~ ~ ' ' — * ^ ' — ' ’ — 

^i,k^j,k ^i,k^ j ,k ^i,k^j,k ^i,k^j^k 


We will exploit this general phenomenon in the proof of our main results. 
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2.2 Chung, Cheeger and Alon-Boppana ineqnalities for non-backtracking 
matrices 

The aim of this subsection is to advocate the use of non-backtracking matrices. Here, we discuss 
briefly candidate counterparts for irregular graphs of inequalities that are classical in the context 
of regular graphs. This subsection will not be used in the proof of our main results, it may be 
skipped. 

The diameter bound of Chung [7] gives an upper bound on the diameter of a regular graph 
in terms of its spectral gap. The following lemma, expressed in terms of the decomposition (4) 
of is an analogue: 

Lemma 1. Let e = (u,u'), f = {v,v') G E be such that 

Then, the graph distance between u and v is at most fc -|- 1. 

Proof. Observe that if > 0 then u and v are at most at distance fc -|- 1. On the other 

hand from (4), 

ra 

i=2 

has absolute value at most S 2 ,fc from Cauchy-Schwartz inequality and the orthonormality of the 
Xj,k, j < 1 X 1 . Thus finally, > si^kXi,k{e)xi^k{f) - S 2 - □ 

Cheeger-type inequalities connect the expansion ratio (isoperimetry) of the graph and its 
spectral gap, for a survey see [14]. For a subset X C E of edges, we measure its volume by 

V,{X) = Y,xlk{e)- 

eGX 

By construction, our volume is normalized with Vk{E) = 1. We say that X C E is edge- 
symmetric if X = X. For example the set of edges adjacent to a given subset of vertices is 
edge-symmetric. If X, Y are edge-symmetric, we define 

Ek{X,Y)= Y. Xi^kie)x,,kif)B^f. 

e&XJ&Y 

Since B^j, is the number of non-backtracking walks of length fc -|- 1 starting with e and ending 
with /, Ek{X, Y) measures a kind of conductance between X and Y with a proximity range of 
radius fc -|- 1. If X'^ = X\E, the scalar 

YkiX) = YkiX^) = Ek{X,X^). 

can be thought of as the outer surface of a set X. The fc-th expansion ratio of G is then defined 

as , , 

. _ • Sfe(X) 

xctI=xl4(X)Al4(X-)- 

In (3), after reordering the eigenvalues of B'^P as ^ > o' 2 ^k > • ■ • > o'm,k, xri^k — <X 2 ,k plays 
the role of the spectral gap in the classical Cheeger inequality. With this new convention, the 
following lemma is the analog of the easy part of Cheeger’s inequality for graphs. 
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Lemma 2. 


— (72,k < 2/lfc. 


Proof. The argument is standard. For simplicity, we drop the index k. From Courant-Fisher 
min-max Theorem, we have 

{x,B^Px) {x,B^x) 

02 = max —n— ttt;— = max 


{x,xi)=o ||a;|| 

Let X G Ehe edge-symmetric. We set 


{x,xi)=o ||a:|| 


By construction {x,xi) = 0 and ||a:|p = \/V{X) -\- 1/V{X'^). Hence, 


^ V^) ^ B^^fx{e)x{f). 


However, using the edge-symmetry of X and 


V{X)V{X‘=)' 


Also, using the singular value equation B^xi = cricci, 

Y, By,{e)x^{f) = Y B':fX^ie)x^{f)-E{X)=a,ViX)-EiX), 

ejex eeX,feE 

and similarly for X‘^. So finally, 

2 


1 


1 


vix) h(a:^) 


(72 > 


f 1 


1 


Vh(a:) v{X‘() 


CTi - 


1 


1 


V{X) H(X=) 


E{X). 


Since, for x, a:' > 0, 1/a; -I- 1/a;' < 2/{x A x'), it concludes the proof. 


□ 


The Alon-Boppana theorem gives a lower bound on the second largest eigenvalue of the ad¬ 
jacency matrix of a regular graph (see [27, 14]). We conclude this paragraph with an elementary 
bound of this type. We introduce for e G E, 

Sk{e) = {S,,B'^x) = \\B'^PSe\\i- 


In words, Sk{e) is the number of non-backtracking walks of length A; -|- 1 starting with e. As 
already pointed, if B is irreducible, the Perron eigenvalue is the growth rate of the universal 
cover of the graph: for any e G E, 


Xi{B) = lim = lim 


k 

k—^oo ’ k—yoc 


We observe that 


Hence, we find. 


si.fc + (m- > tT{B^B*^) > Y Bkie). 


eGE 


si 


,fc> -5]5fc(e)-^. 
’ m m 


(6) 


eGE 


This last crude inequality gives a lower bound on the second largest singular value of B^. 
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3 Main results 


We now state our results on the non-backtracking spectra of Erdos-Renyi graphs first, and 
Stochastic Block Models next. 

3.1 Erdos-Renyi graphs 

Let the vector y on be defined as 

X(e) = 1, e e E. 

The Euclidiean norm of a vector x S will be denoted by ||x||. We have the following theorem. 

Theorem 3. Let G be an Erdos-Renyi graph with parameters (n, a/n) for some fixed parameter 
a > 1. Then, with probability tending to 1 as n ^ oo, the eigenvalues Xi{B) of its non¬ 
backtracking matrix B satisfy 

Ai(i?) = a-I-o(l) and |A2(i3)| <-s/o + o(l). 

Moreover the normalized Perron-Frobenius eigenvector associated to Xi{B) is asymptotically 
aligned with 

B^B*‘^X 

Wb^V 

where £ ^ Klog„ n for any 0 < k < 1/6. 

Theorem 3 is illustrated by Figure 3.1. We conjecture that the lower bound |A2(i3)| > 
^/a — o(l) holds, it is reasonable in view of Figure 3.1. We shall prove a weaker lower bound, 
see forthcoming Remark 12. It is also an interesting open problem to study the convergence of 
the empirical distribution of the eigenvalues of B. 



Figure 1: Left : eigenvalues of B for a realization of an Erdos-Renyi graph with parameters (n, a/n) 
with n = 500, a = 4. Right : eigenvalues of B for Example 2 with n = 500, r = 2, a = 7, 5 = 1. 





3.2 Stochastic Block Model 


For integer fc > 1, we set [k] = fc}. We consider a random graph G = {V,E) on the 

vertex set V = [n] defined as follows. Each vertex v G [n] is given a type crn(v) from the set [r] 
where the number of types r is assumed fixed and the map cr„ : [n] —>■ [r] is such that, for all 
i G [r], 

1 " 

7rn(j) := - l(crn(u) =i) = Tr{i) + o(l), (7) 

v — l 

for some probability vector tt = (7 r(l), • • • , 7r(r)). For ease of notation, we often write a in place 
of 

Given a symmetric matrix W G Mr(M+) we assume that there is an edge between vertices 
u and V independently with probability 

W{a{u),a{v)) ^ ^ 
n 

We set n = diag(7 r(l),..., 7r(r)) and introduce the mean progeny matrix M = UW (the branch¬ 
ing process terminology will be clear in Section 8). Note that the eigenvalues of M are the same 
as the ones of the symmetric matrix S = and in particular are real-valued. They 

are also the same as the ones of the expected adjacency matrix A := E(7l). We denote them by 
pLk and order them by their absolute value, 

l^rl < • • • < |M2| < Ml, 

We shall make the following assumptions: 

Ml > 1 and M is positively regular, (8) 

i.e. for some integer fc > 1, has positive coefficients. In particular, fii > max/j >2 |Mfc| is the 
Perron-Frobenius eigenvalue. It implies notably that for all i G [r], 7r(i) > 0. We define ro by 

Mfe > Ml for all fc € [ro] and Mro-i-i — /^i, 

(with p,r+i = 0). Since M = the matrix M is diagonalizable. Let {wiligp] be 

an orthonormal basis of eigenvectors of S such that Sui = piUi. Then (pi := and 

ipi = are the left and right-eigenvectors associated to eigenvalue mo 

Mipi = We get 


— ^ij: and, (^2, ^1)77 — ^ij: (fi) 

where {x,y)T^ = denotes the usual inner product in The following spectral 

decompositions will also be useful 

r r 

M = E 

k^l k^l 

where the second identity comes from 'ijjk = and W = 
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We will make the further assumption that each vertex type has the same asymptotic average 
degree a > 1, i.e., 

r r 

a = ^ Trii)Wij = ^ Mij for all j S [r]. ( 11 ) 

i=l i=l 

This entails that M*/a is a stochastic matrix and we then have 

fii = a > 1, (pi = 1 and ipi = n*. ( 12 ) 

We will also assume that a quantitative version of (7) holds, namely that for some 7 S (0,1], 

Ik - 7r„||oo = max|7r(0 - 7 r„(i)| = (13) 

iG[r] 

The random graph G is usually called the stochastic block model (SBM for short) or in¬ 
homogeneous random graph, see Bollobas, Janson and Riordan [5] and Holland, Laskey and 
Leinhardt [13]. A popular case is when the map cr is itself random and cr{v) are i.i.d. with 
distribution ( 7 r(l),--- , 7 r(r)). In this case, with probability one, condition (13) is met for any 
7 < 1/2. 

Example 1. If r = 2, then we have 7 r(l) = 1 — 7 r( 2 ). Under condition (11), we have W 22 = 
( 7 r(l)lTii -I- (1 — 27r(l))Wi2)/(l — 7 r(l)) so that fxi = a = 7 r(l)lTii -I- (1 — 7 r(l))Hi 2 and ^2 = 
7r(l)(Wii-Wi2). 

Example 2. If r > 2, Tr{i) = 1/r and Wu = a ^ b = Wij for all i j so that condition (11) is 
satisfied. We have = a = {a + {r — l)b)/r and fj ,2 = ■ ■ ■ = /J-r = [a — b)/r. 

For k G [r], we introduce the vector on 

Xfe(e) = </fc(cr(e 2 )) for all eG E. (14) 


In particular, xi = X- O^r main result is the following generalization of Theorem 3. 

Theorem 4. Let G be an SBM as above such that hypotheses (8,11,13) hold. Then with prob¬ 
ability tending to 1 as n ^ go, 


Afe(B) = -I- 0 ( 1 ) forkG[rff\, andfork>ro, |Afe(i?)| < a/o-I- o(l). 


Moreover, if pLk is a simple eigenvalue of M for some k G [rg], then a normalized eigenvector, 
say f,k, of Xk{B) is asymptotically aligned with 


B^B*^Xk 

\\B^B*^Xkr 


(15) 


where £ ~ ^^loga n for any 0 < k < 7 / 6 . Finally, the vectors fk of these simple eigenvalues are 
asymptotically orthogonal. 

It follows from this result that a non-trivial estimation of the node types a{v) is feasible 
on the basis of the eigenvectors {Cfe} 2 <fc<ro provided rg > 1. More precisely, for vertex type 
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estimators a{v) : [n] —>■ [r] based on the observed random graph G, following Decelle et al. [8], 
define the overlap ov((T, a) as the minimum over permutations p : [r] —>■ [r] of the quantity 


1 




/ , ^a(v)—poiT(v) max7r(/c). 
n —• fcG n 

V — 1 


(16) 


We shall say that a has asymptotic overlap S if ov((T, a) converges in probability to ^ as n grows. 
It has asymptotic positive overlap if for some (5 > 0, ov((T, a) > 6 with probability tending to 1 
as n grows. Note that an asymptotic overlap of zero is always achievable by assigning to each 
vertex the type k* that maximizes 7r(fc). In the case where all communities have asymptotically 
the same size, i.e. 7r(f) = 1/r, zero overlap is also achieved by assigning types at random. 

As conjectured in [8] and proven in [24], in the setup of Example 2 with r = 2, the best 
possible overlap is o(I) with high probability when rg = I, i.e. when fj ,2 < Conversely, 

adapting the argument in [23], when rg > I, we have the following 


Theorem 5. Let G be an SBM as above such that hypotheses (8,11,13) hold. Assume further 
that 7r(f) = I/r, that rg > I and that for some k € {2, ..., rg}, is a simple eigenvalue of M. 
Let fk & be a normalized eigenvector of B associated with Xk{B). 

Then, there exists a deterministic threshold t G K, a partition (/"'■, I~) of [r] and a random 
signing oj G { — I, l}'^ dependent of fk such that the following estimation procedure yields asymp¬ 
totically positive overlap: assign to each vertex v a label d'(v) picked uniformly at random from 
/+ if i-v{v)J 2 e-e 2 -v ^kie) ^ and from I~ otherwise. 

The reason for the existence of the signing w € { — 1,1}^ in the above statement is that 
we do not know a priori whether the vector fk or —fj- is asymptotically close to (15). In the 
simplest case, we will be able to estimate this sign and the vector oj will be equal to —1 or 1 and 
1+ = {iG [r] : > 0}, /- = [r]\I+. 


3.3 Notation 


We say that a sequence of events En holds with high probability, abbreviated w.h.p. , if lim„_>oo ^{En) 
1. The operator norm of a square matrix C is denoted by 


IIGII =sup 


II Ga 


c/o ||a:|| 


We denote by C* the transpose of G. 

Given a (non-oriented) graph G = {V, E), we denote by 7 = (70,..., 7fc) a walk of length k 
where each 7i S E and {7^,7^+!} G E for all i G {0, fc— I}. We also denote the concatenation of 
two walks 7 and 7' by (7,7'). A walk is non-backtracking if for all i G {0, ... ,k — 2}, 7^ 7i+2. 

A walk contains a cycle if there exists i ) with 7^ = 7^-. 


4 Algebraic tools: Perturbation of Eigenvalues and Eigen¬ 
vectors 

One main tool in our analysis is the Bauer-Fike Theorem. The form given below elaborates on 
the usual statement of the Theorem which in general omits the second half. 


II 



Theorem 6. (Bauer-Fike Theorem; see [4, Theorem VI.5.1]). Let D be a diagonalizable matrix 
such that for some invertible matrix V and diagonal matrix A one has D = V~^KV. Let E be 
a perturbation matrix. 

Then any eigenvalue p. of D + E verifies 

min - Ail < ||i;|| 

i 

where Xi is the i-th diagonal entry of A. 

Denote by R the right-hand side of (17) and Ci 
R. Let I be a set of indices such that 

(U.eiQ) f| (U,^iQ) = 0. (18) 

Then the number of eigenvalues of D E in UigxC'i is exactly |I|. 

The following proposition on perturbation of rank one matrices will be a basic ingredient 
to deduce from expressions like (3) quantitative versions of (5). It relies on the stability of 
eigenvalues and eigenvectors of matrices which are not too far from being normal {A is normal 
if and only if A*A = AA*; see [15]), and with a well separated spectrum. 

Proposition 7. Let A G Mn(R), T 1 he mutually prime integers, 9 G K\{0}, andco, ci > 0 
such that for any k G {£,£'}, for some Xk,yk G K”, Rk € Af„(K), 

A^ = e^xkvl + Rk, 


( 17 ) 

:= B{Xi, R) the ball centered at Xi with radius 


with {yk,Xk) > Co, ||a;fc||||yfc|| < ci and 


Pfell < 


2{£y£')ci 


\ 0 \^- 


Let {Xi), 1 < i < n be the eigenvalues of A, with |An| < ... < |Ai|. Then, Ai has multiplicity 
one and we have 

\Xi-0\<C\9\l£ and, for i> 2, |A,| < f \\Ri\\^/^, 


with C = 7r/2 + 2>/ci V 1 log(2(ci V Cg ^)). Moreover, there exists a unit eigenvector ip of A with 
eigenvalue Ai such that 


iP 


Xi 




Proof. We can assume without loss of generality that 9 = 1. We fix fc £ {£,£'} and let x = 
Xk/\\xk\\, y = yfe/||yfc||, <J = ||a::fc||||2/fc||, v = {yk,Xk). We have 


A'^ =axy* -GRk = S-G Rk. 


Our objective is to apply Bauer-Fike Theorem. To this end, we write S = UDU where 
D = diag(z/, 0,..., 0), U = (i, / 2 ,..., /„) with fi = y and (/*) i >2 is an orthogonal basis of y-^ 
(we will soon check that U is indeed invertible). We can also assume that x G span(/i,/ 2 ). 
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Obviously, the eigenvalues of coincide with the eigenvalues oi D + U ^RkU. We have to 
compute the condition number of U : 

k{U) = \\U\\\\U-^. 


We consider the unitary matrix V = (/i,..., /„). Let a, & G C be such that x = afi + bf 2 - We 
find 


V*U 


W 0 

0 Ln-2 


with 




In particular, 


k(17) = k{V*U) = k{W). 


As |ap + l&p = 1 we have 


W*W 



The eigenvalues of WW* are 1 ± |6|. We deduce that 


«([/) 



Now, by assumption, |6| = a/ 1 — |op = \JI — |(x,y)|^ < — CqC^^ < 1 — c§cj"^/2. We obtain 

that 

k{U) < k = 2cic^^. 

Notice that by assumptionn £\/ £' > 2 and 2K||i?fe|| < Cq < \v\. An application of the Theorem 6 
to H + U~^RkU then implies that there is a unique eigenvalue of in the ball {z G C : 
W — z\ < K||i?fc|j} and all the other eigenvalues lie in the disjoint domain {z G C : |z| < K||i?fc||}. 
Consequently, 

\X'l - v\ < K\\Rk\\ and, for j > 2, lAi]'" < K||i?fc||. (19) 

In particular, the eigenvalue Aj has multiplicity one in and thus Ai has multiplicity one in 
A. We now bound the difference between Ai and 0 = 1. First, by assumption. 


Cq < 1/ < Cl- 


( 20 ) 


From (19), we deduce that ||Aj| — i^| < co/2 and hence co/2 < |Aj| < 2ci. Since for all x G K, 
|e^ — 1| < |x|e'^+, we get 


- II < 


(2ci)i/'=-l V (co/2)i/'=-l <C2/k, 


with C 2 = a/(2ci) V llog(2(ci V C(C^)). 

We now control the argument w G (—7r,7r] of Ai = |Ai| exp(*a;). By assumption, v > cq > 0, 
hence by (19) the real part of Af is positive. There thus exist an integer q G Z and some 
e G (—1/2,1/2) such that 

ku} = 2qTT + STT. 

Since |x| < (7r/2)| sin(x)| for x G [—7r/2,7r/2], we obtain from (19) that 


n\\Rk\\ > lAil'^l sin(e7r)| > |Ai|'"2|e| > co|e|. 
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so that |£| < (/c/co)||i?fe|| = ciCq ^||i?/c||. The above holds for k G {£,£'}■ Hence, with the 
notation £ = e(£), s' = e{£'), q = q{£), q' = q{£') we have 


UJ = 


2qTT 


+ 


£7r 

T 


2g'7r ^ £'7r 


so that 


qi' - q'£ 


e'£ - e£' 
2 


Using the assumption ||i?/c|i < (^l{2ci{£y £')), we find that both e'£ and e£’ have modulus 
strictly less than 1 and hence so does the right hand side of the last display. It follows that 
q£' = q'£. Since £ and £' are mutually prime, we deduce that £ divides q and £' divides g', so 
that modulo 27r, w = £7r/^ = dTiji'. Thus for k G {£,£'}, 


|Ai - 1| < |Ai||e"^/'= - 1| + IIAil - 1| < (l + I) ^ 

As e < 1/k, the right-hand side of the above is no larger than A:“^(2c2-l-7r/2) if k > 2, which must 
hold for some k G {£,£'}■ This concludes the proof of the claim of Proposition 7 on eigenvalues. 

Consider now a normed eigenvector z of A associated with Ai, which for fixed k G {£,£'^ 
admits the orthogonal decomposition z = zq + where Zq G span(y/j, i^.). Applying A^, we 
obtain 

X\z = axkvlzo + RkZ. 

Projecting onto {yk,ik}'^ yields |Ai|^||z-*-|| < ||i?fc||. Using the bound |Ai|^ > co/2 gives 

\\z^\\<hRu\\. 

Co 

Projecting onto span(yfc, ife) yields 


IIAJ^zo - axkvlzoW < ||i?fc||. 


This entails 

< ^ II p II < ^ll^fcll 

- lAil'^lIzoll" ''"-colkoir 

where c is some scalar and we have used |Ai|^ > co/2. We now use the following general 
inequality 


zo 


Uoll 


- CXk 


if |ju|| = ||u|| = 1 and for some t > 0, ||u — tv\\ < e then ||u — u|| < 2£. (21) 


We deduce that 



Colkoll' 


From the trianle inequality, ||zo|| > 1 — ||z'*‘||. We then have 


Ik - ifcll < + Iko - Ikolkfcll +1 - Ikoll < 

Co 

Finally, since A = A* has multiplicity one in A^, the eigenspace of Ai for A coincides with the 
eigenspace of Af for A^. This concludes the proof of Proposition 7. □ 
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We now provide an extension of Proposition 7 to arbitrary rank tailored to our future needs. 
For X = {xi,... ,Xn) € C", the multiplicity of z G C in x is defined as fh® number 

of coordinates of x equal to z. 


Proposition 8. Let A G £'< £ < 2£' be mutually prime odd integers, 9 = {6i,... ,6^) G 

(K \ {0})’’ such that for any k G {£, £'}, for some vectors Xk,i,yk,i, ■ ■ ■, Xk,r, yk,r G K” and some 
matrix Rk G M„(]R), 

r 

A'^ = J2e’;xk,0l,+Rk. 

f=i 

Assume there exist co,Ci > 0 such that for all i ^ j G [r], {yk,j,Xkj) > cq, ||a;fcj||||yfej|| < ci, 
{^kj-,yk,i) — k ,j, ^ k — {yk,j,yk,i) — 0 and 


Pfcll < 


/ CoicoJ^ - Ci) + 




4ci 


A 


2{£v£')ci 


where £) = min^ \9i\, 7 = min{0i/dj ■. 9i > 9j > 0 or 9i < 0j < 0} (the minimum over the empty 
set being + 00 / Let (Ai), 1 <i <n, be the eigenvalues of A with |A„| < ... < |Ai|. Then, there 
exists a permutation a G Sr such that for all i G [r], 

|A^ - 6<^(i)| < and, for i>r+ 1, |A,| < \\Ri\\^^\ 

with C = 7r/2 + 2^/ci V 1 log(2(ci V Cq ^))- Moreover, if 9,r{i) has multiplicity one in 9, Xi is a 
simple eigenvalue and there exists a unit eigenvector ifi of A with eigenvalue Xi such that 


with C = 24ciCq ^/(l A (co 7 *^ — ci)+ A cq). 




Proof. We may assume that = 1. Fix k G {£,£'} and let Xj = Xkj/\\xk,j\\, yj = yk,j/\\yk,j\\, 
= 0j\\xk,j\\\\yk,j\\, k'j = 0j{yk,j,Xk,j) = (7j{yk,j,Xkj). Let Hj = spa.n{xkj,yk,j)- By assump¬ 
tion, the vector spaces Hj, 1 < j < n, are orthogonal. For ease of notation, let us assume for all 
j G [r], Hj has dimension 2 (the case where Xkj and yk,j are colinear is identical). We consider 
an orthonormal basis (/i, • • • , /„) of C", such that span(/ 2 j_i, / 2 j) = Hj, f 2 j-i = yj- We have 

r 

A'^ = Y, +Rk = UDU-^ + Rk, 

i=i 


where D = diag(z/i, 0, ^ 2 ,0,..., 0,..., 0), U = (ii,/ 2 , ^ 2 ,/ 4 , ■ • ■, Sr,/ 2 r, • • ■,/n) (provided 

that U is indeed invertible). Arguing as in the proof of Proposition 7, denote by V the unitary 
matrix V = {fi, ■ ■ ■, fn) and decompose Xj as Xj = Ojf^j-i + bjf 2 j. Then V*U has a block 
diagonal structure with blocks Wj, 1 < j <r, and In- 2 r, where 



We find, as in Proposition 7, 


k{U) = maxK(kF,) < k = 2ciCq 
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Now, by assumption, 2«;||i?fc|| < cq A (co 7 ^ — ci)+ is less than the minimal distance between the 
distinct eigenvalues of D. We deduce from Theorem 6 applied to D -\- U~^RkU that there is a 
permutation s G Sr such that 

\>^i - < K\\Rk\\ and, for i > r + 1, \X^\^ < K\\Rk\\. (22) 

Importantly, we claim that the permutation s = Sk G Sr is such that for 1 < i < r, 
Indeed, we first observe that the assumptions y^co > ci and k odd imply that 
Vi = Vj is equivalent to 9i = 9j. Assume first for simplicity that all 6i are positive and let mi be 
the multiplicity in 0 of ti = max^ 9i. Then the mi eigenvalues such that |Af — Vj\ < K||-Rfc|| for 
some j such that 9j = ti are precisely the mi largest eigenvalues of A. If mi < r, we may then 
repeat the same argument for the second largest value of the set {9i, • • • , 0r}. By iteration, we 
deduce the claimed statement when all 9i have the same sign. In the general case, we notice 
that if |Af —Vj\ < £ = 9j > 0, then Vj > cq, | sin(Af )| < e/co, | arg(Af )| < 7r£/(2co) 

and |arg(Af)| < -Kelji^CQa) < ■n/2 (we use here the assumption (. < 21"). It follows that if 
|Af — i^j \ < 9j > 0 then we cannot have |Af — Vji \ < K||i?^|| with 9j' < 0. We may 

thus repeat the previous argument by considering the largest eigenvalues of A with positive real 
part and the largest eigenvalues of A with negative real part separately. 

We now bound the difference between Ai and 9si^i). By assumption, Co|dj| < \vj\ < Ci|dj|. 
Hence, arguing as in the proof of Proposition 7, 

l^s(i)l| — £2\9s(^i)\/k^ 

with C 2 = \/ci V 1 log(2(ci V Cf)"^)). We next control the argument uji G of Xi = 

|Ai|sign(0cr(i))e*‘^. Arguing as in the proof of Proposition 7, we get for p GZ, |a; — 2p7r| < Tr\£\/k 
and we may conclude the proof of the claim of Proposition 8 on eigenvalues as in Proposition 7. 

It now remains to control the eigenvector of A^ such that 0CT(i) has multiplicity one. First 
from (22), Xi is a simple eigenvalue of A. Let 2 ; be a corresponding normed eigenvector of A. 
Applying yields 

XiZ = ^ aj{y*z)xj + RkZ. (23) 

jelr] 

Applying once more to (23) yields 

Af 2 ; = X^RkZ+Y^ aj[aj{y*z){y*Xj) + y*Rkz\x,. 
jeH 

Multiplying (23) by X^ and subracting it to the previous display yields 

^ crj [(Af - Vj)y*z - y*Rkz]xj = 0. 
jeH 

Thus for all j G [r], 

iX^-v,)rjZ-r^RkZ = 0. (24) 

Now for j ^ s{i), from (22), we have 

\Xi - Vj\ > - Vj\ - \X^ - > ^((coy'" - ci)+ A cq). 
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It follows that 


1/2 


E \y. 




< 


2||i?fc|| 


(co7'=-Ci)+Aco' 


Moreover, this implies upon dividing (24) by 


1/2 


1/2 


E 

\3¥^s{i) 


1 ~:^c 


E 


? 




2||i?fc|| 


Co V(co7''- Ci)+A Co Co 


+ -Pfc|| , 


where we have used the fact that |A^| > co/2. It then follows from (23) that for some constant 

\ k 


c = CTs(,)(y*(^)z)/Af, 

'^0 V 


2|li?fc|| 




(007^= - Cl)+ A Co Co 


+ -Pfc|| ■ 


We then obtain the announced bound on \\z — is(i)|| by appealing to (21). 


□ 


We conclude this paragraph with an elementary lemma on Gram-Schmidt orthonormalization 
process. It will be used to obtain vectors which are exactly orthogonal as in the assumptions of 
Proposition 8. 

Lemma 9. Let iti, • • • , U/t be vectors in C" with unit norms such that \ {ui^ Uj) \ < 5 for all i j ■ 
If S < k~^ then (iti, • • • , Uk) are linearly independent and, if (ui, • • • , Uk) is the Gram-Schmidt 
orthonormalization process of (ui, • • • ,Uk), we have for all j G [k], 


\\uj-uj\\ < 6f. 


Proof. We prove the statement by induction. For fc = 1, ui = ui. For fc > 1, we denote by Vk+i 
the orthonormal projection of Uk+i on the span of (ui, • • • ,Uk). We have 

Ikfc+lf = 

i=i 

Now, from the induction hypothesis, 

k k 

\{uk+i,Uj)\^ < 2Y {\{uk+i,Uj)\^ + \\uj - Ujf) < 2M2(1 + fc2'=). 
i=i 1=1 

It is easy to check that •\/2fc(l + k^) < 2~^(k + 1)^+^ for all A: > 1. In particular, if <5 < 
{k + Vk+i Ufc+i and then from (21), ||ufe+i - Ufe+iH < 2||ufe+i|| < 5{k + l)'=+b □ 


5 Erdos-Renyi graph: proof strategy for Theorem 3 

In what follows, we consider a sequence £ = £(n) ~ Klog^ n for some k G (0,1/6) as in Theorem 

3. 
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5.1 Proof of Theorem 3 


Let 


and 




c = 




\\B‘B*^Xr 

(if 6* = 0, we set ^ = 0). The proof relies on the following two propositions. 

Proposition 10. For some Ci,Co > 0, w.h.p. 

(Cl f) ^ Co and < cio^. 

Proposition 11. For some c > 0, w.h.p. 

sup ll-B^a;|| < (logn)‘^Q;^/^. 

0 ,|| 3 ;|| — 1 

Let us check that the last two propositions 10 and 11 imply Theorem 3 . Let R = B^ — 9(^(p* 
and y G with ||j/|| = 1. We write y = s(p + x with x G (p'^ and s S K. We find 


\\Ry\\ = \\B‘^x + s{B^(p-9C,)\\< sup 

a::(ic,(^)=0,||ai||=l 


\B^x\\. 


Hence, Proposition 11 implies that w.h.p. 

\\R\\ < (logn)'^a^/^. 

We may now apply Proposition 7. If Xi = Xi{B)., we find that w.h.p. 
|Ai-a| =0(l/f), IA 2 I < (C{\0gnfa^/^y = + 


(25) 


and the normalized Perron eigenvector ^ of H satisfies w.h.p. 

lie - Cll = 0((logn)^a"^/^). 


This concludes the proof of Theorem 3. 

Remark 12. We note that from [15, Theorem 3.3.16], we get that in (4), 

|si.^ - 0| < I1R|| and S 2 ,^<||i?|l. 

Hence, from (25) w.h.p. 

sij = 0{a^) and S 2 ,e = 0((logn)°a^^^) 

On the other hand, (6) implies that the above upper bound on S 2 .t is also a lower bound up to 
the logarithmic factors, more precisely, w.h.p. , S 2 ,i > Cqu^^"^ for some cq > 0 (it follows from 
the proof of the forthcoming Theorem 37). Therefore, the naive lower bound on S 2 ^i in (6) is 
asymptotically tight and Propositions 10-11 may be interpreted as a weak Ramanujan property 
for Erdos-Renyi random graphs. 
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Proposition 10 will follow from a local analysis. Namely the statistics of node neighborhoods 
up to distance £ in the original random graph will be related by coupling to a Gallon-Watson 
branching process; relevant properties of the corresponding Gallon-Watson process will be estab¬ 
lished; finally we shall deduce weak laws of large numbers for the ^-neighborhoods of the random 
graph from the estimations performed on the branching process combined with some asymptotic 
decorrelation property between distinct node neighborhoods. This is done in Section 9 where 
Proposition 19, which contains Proposition 10, is proven. 

The proof of Proposition 11 relies crucially on a matrix expansion given in Proposition 13, 
which extends the argument introduced in [23] for matrices counting self-avoiding walks to the 
present setup where non-backtracking walks instead are considered. We now introduce some 
notation to state it. 

5.2 Matrix expansion for 

For convenience we extend matrix B and vector y to where E{V) = {(it, v) : u ^ v 

is the set of directed edges of the complete graph. We set for all e, / S x(e) = 1 and 

Bef = ^e^/l(e2 = /i)l(ei ^ /2), 

where A is the graph’s adjacency matrix. For integer k > 1, e, f G E{V), we define F*^ as the 
set of non-backtracking walks 7 = ( 70 ,..., 7 fc) of length k starting from ( 70 , 71 ) = e and ending 
at ( 7 fe_i, 7 fc) = / in the complete graph on the vertex set V. We have that 

^gpfc+ls =0 

We associate to each walk 7 = (jo, ■ ■ ■ , 7 k), a graph 0 ( 7 ) = (V( 7 ),E(-j)) with vertex set 
^( 7 ) = {7i, 0 < 1 < fc} and edge set if ( 7 ) the set of distinct visited edges ( 7 ^, 7 i+i}, 0 < i < fc— 1 . 
Following [24], we say that a graph ii is tangle-free (or i-tangle free to make the dependence 
in i explicit) if every neighborhood of radius £ in ii contains at most one cycle. Otherwise, 
H is said to be tangled. We say that 7 is tangle-free or tangled if 0 ( 7 ) is. Obviously, if G is 
tangle-free and 1 < fc < £ then = B^'^\ where 

^gp,fc+i s=0 

and is the subset of tangle-free paths in F^^^. For u d, we set 


We define similarly the matrix on 

76-P’e/" s=0 
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The matrix can be thought of as an attempt to center the non-backtracking matrix 
when the underlying graph is tangle-free. We use the convention that a product over an empty 
set is equal to 1. We also set 

A® = l(e = f)A, and = l(e = f)A,. (26) 

Notably, is the projection on E. We have the following telescopic sum decomposition. 


B 


(t) 

ef 


A 


W 

ef 


t-1 


t=0 


Y. n 


^7s7s + i ■ 


7eF^^+i s^O 




(27) 


Indeed, 


e i e t-i i 

-l-^I|ys(a;t-2/t) x^. 

s=0 s=0 t—0 s—0 


We denote by K the non-backtracking matrix of the complete graph on V. For 0 < < < £, we 
define via 

t-i t 

(-^t ^)e/ = ^ IT—7s7s+l IT ^7.7s+l> 

S—0 S— 1+1 

where for 1 < t < £ — 1 , C is the set of non-backtracking tangled paths 7 = 

(70,..., 7 ^-hi) = (Y, 7") G rf/^ with 7' = (70,..., 7 t) G Fig, 7" = (7t+i,..., -fi+i) G F^g 7 f for 
some g,g' G E{V). For t = 0 , is the set of non-backtracking tangled paths 7 = (7', 7") 
with 7' = ei, 7" = (71,... ,7^+i) G F^,^ for some g' G E{V) (necessarily gl = 62). Similarly, 
for t = Fl+j is the set of non-backtracking tangled paths 7 = (70,... = (7^7") with 

7" = h, i = (70,... ,70 G Fig for some g G E{V) (necessarily 52 = /i)- 
We define 


L = K^- XX* 


{L is nearly the orthogonal projection of on y*"). We further denote for 1 < t < — 1 


We then have 

Proposition 13. With the above notations matrix bF") admits the following expansion 

n n f ^ n n 






If G is tangle-free, for any normed vector x G one has 


\B^x\\ < ||AW|| +Y||i^B(^-i)|| + 

n n 

+ 5|:i|s!'>||+o||A<'-‘>|| + 2^||i!!'>||. 

n t ^ n < ^ 


(28) 


(29) 

Proof. Equation (28) readily follows by adding and subtracting to the t-th term of the 

summation in (27) and noticing that this term plus (^i?|^Yactorizes into a matrix product. 
Inequality (29) follows from (28) by noting that B^ = B^^'> as G is tangle-free, decomposing 
into L XX* 1 finally using the fact that |jiG|| < n. □ 
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5.3 Norm bounds 

The following proposition will be established in Section 6 using path counting combinatorial 
arguments. 

Proposition 14. Let I ^ Klog^, n with k G (0,1/6). With high probability, the following norm 
bounds hold for all k, 0 < k < £: 

l!A('=)|| < (logn)i°a'=/2^ (30) 

< (logn)®a^/^v^, (31) 

< (logn)25a^-'=/2, (32) 

11^^11 < (logn)i°a'= and \\K\\ < ^{\og, (33) 

and the following bound holds for all k, l<k<£—l: 

ll^fll < v^(logn)2V-'=A. (34) 

5.4 Proof of Proposition 11 

Together with Propositions 13 and 14, we shall also need the next two results, established by 
local analysis in Section 9. In particular the forthcoming Lemma 30 implies that 

Lemma 15. For £ ~ ^logo, n with k < 1/2, w.h.p. the random graph G is i-tangle-free. 

For the Erdds-Renyi graph. Corollary 34 states the following. 

Proposition 16. For I ^ ^logo, n with k < 1/2, w.h.p. , for any 0 < t < £ — 1, it holds that 

sup \{B*x,x) \ < (logn)^n^^^a*^^. 

\\x\\ = l,{B^X,x)=0 

We now have all the ingredients necessary to prove Proposition 11. In view of Lemma 15, 
we may use the bound (29) of Proposition 13 and take the supremum over of all x, ||a;|| = 1, 
{(f,x) = {x,B^x) = 0. By the norm bounds (30)-(32)-(34) of Proposition 14, w.h.p. 

q,||A(^-i)|| + IIA^^^Il + - ^ llRf^ll + IlS'f^ll < C(logn)'=a^/^(l + = 0((logn)V^^). 

Also, from (2), since x = X; 


sup 

(x,B(‘)x) 

= sup 


= sup 

{B^*^X,x) 

\\x\\^l,{x.B^x)=^0 




\\x\\=^l,{B^X.x)^0 



Hence, from Proposition 16 and norm bound (31), w.h.p. 

I!A(‘-i)x|||(x,b(^-‘-i)x)| < C(logn)W /2 

Hence, w.h.p. 

^ ^ II A(‘-i) III (x, I = o((log . 

It remains to use norm bound ((33) to deal with the term ||iFi3(^“^)||/n in (29) to conclude the 
proof of Proposition 11. 
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6 Proof of Proposition 14: path count combinatorics 


In this section, we use the method of moments to prove the norm upper bounds stated in 
Proposition 14. Recall that £ ~ k log„ n with k > 0. All our constants will depend implicitly 
on K. 


6.1 Bound (30) on ||A(*^)|| 

The proof will use a version of the trace method. For u > 3, we set 

logn 


(35) 


131og(logn 

The symmetry (2) implies that With the convention that e 2 m+i = ei) we get 




- 1)1 


)e2i + i,e2i 






- 1)1 


2m k 

E nri^7..-. 

7^^/e,Tn ^ = 1 S = 1 


7i,s 


(36) 


where IFfc^m is the set of sequence of paths 7 = ( 71 ,..., 72 m) such that 7 ^ = ( 7 ^, 0 ; •' • , 7i,fc) € 
yfe+i jg non-backtracking tangle-free of length k and for all i = 1 ,..., 2 m, 




with the convention that 70 = 72 m- 

We take expectations in (36) and use independence of the edges A^y together with = 0. 
We find 

2m k 

E||A(fc-i)f- < ^ Enn^7..-17.. (37) 

i=l s=l 

where W'^ ^ is the subset of Wk^m where each non-oriented edge is visited at least twice. For 
each 7 £ Wk,m we associate the graph Gij) = ( 1 ^( 7 ), £’( 7 )) of visited vertices and edges. We 
set 

1 ^( 7 ) = 1 ^( 7 )! and e( 7 ) = |£( 7 )|. 

We say that a path 7 is canonical if 1 ^( 7 ) = {1, • • • , ^^( 7 )} and the vertices are first visited in 
order. Wk,m will denote the set of canonical paths in Wk,m- Every canonical path is isomorphic 
to paths in Wk,m- We also have the following 

Lemma 17 (Enumeration of canonical paths). Let yVk,m{v, e) be the set of canonical paths with 
vfy) = V and 6 ( 7 ) = e. We have 

|Wfc.™(«,e)| < fc2-(2A:m)6™(--"+i). 
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Proof. In order to upper bound \Wk,miv, e)\ we need to find an injective way to encode the 
canonical paths x G yVk,miv,e)- 

Let X = {xi^t)i<i< 2 m,o<t<k G Wk,miv,e). We set Vi.t will be called 

an edge of x. We explore the sequence {xi^t) in lexicographic order denoted by ^ (that is 
(i, t) ^ (1 + 1, t') and (i, t) < {i,t + 1)). We think of the index (i, t) as a time. For 0 < t < fc — 1, 
we say that (z,t) is a first time, if xi^t+i has not been seen before (that is Xi^t+i 7 ^ Xi'^t' for all 
{i',t') ^ (*,t)). If (*,t) is a first time the edge is called a tree edge. By construction, the set 
of tree edges forms a tree T with vertex set {1,..., The edges which are not in T are called 
the excess edges of x. Any vertex different from 1 has its associated tree edge. It follows that 
the cardinal of excess edges ise = e — u + 1. 

We build a first encoding of e). If {i,t) is not a first time, we say that {i,t) is an 

important time and we mark the time {i,f) by the vector {xi^t+hXi^r)-, where (*,t) is the next 
time that yi^^ will not be a tree edge (by convention r = fc if Xi^s remains on the tree for all 
t + 1 < s < fc). Since there is a unique non-backtracking path between two vertices of a tree, 
we can reconstruct x G Vdk.m from the position of the important times and their mark. It gives 
rise to our first encoding. 

The main issue with this encoding is that the number of important times could be large. 
We have however not used so far the hypothesis that each path Xi is tangle free. To this end, 
we are going to partition important times into three categories, short cycling, long cycling and 
superfluous times. First consider the case where the i-th path Xi contains a cycle. For each i, 
the first time {i, f) such that Xi^t+i € {xip,..., Xi^t\ is called a short cycling time. Let 0 < a <t 
be such that Xi^t+i = By the assumption of tangle-freeness, C := {xi^a, ■ ■ ■ ,Xi^t+i) is the 
only cycle visited by Xi. We denote by {i,T) the first time after {i,f) that in not an edge 
of C (by convention r = fc if Xi remains on C). We add the extra mark r to the short cycling 
time. Important times {i, t) with l<t<a or T<t<k are called long cycling times. The 
other important times are called superfluous. The key observation is that for each I < i < 2m, 
the number of long cycling times {i,t) is bounded by e — 1 (since there is at most one cycle, 
no edge of x can be seen twice outside those of C, the —1 coming from the fact that the short 
cycling time is an excess edge). Now consider the case where the i-ih. path does not contain a 
cycle, then all important times are called long cycling times and their number is bounded by e. 

We now have our second encoding. We can reconstruct x from the positions of the long 
cycling and the short cycling times and their marks. For each 1 < i < 2m, there are at most 
1 short cycling time and e — I long cycling times within Xi if Xi contains a cycle and 0 short 
cycling time and e long cycling times if Xi does not contain a cycle. There are at most 
ways to position them (in time). There are at most v'^ different possible marks for a long cycling 
time and v^k possible marks for a short cycling time. We deduce that 

\Wk,m{v,e)\ < 

We use V < 2km to obtain the announced bound. □ 

Proof of Proposition 14, norm bound (30). From (37) and Markov inequality, it suffices to prove 
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that 


2m k 




16m km 

a 


(38) 


Observe that if 7 S "^(t) ~ 1 ^ ^(7) < fcm and v{'j) > 3 . As („("))^'(7)! < any 

7 S Wfe,m is isomorphic to less that elements in Wk,m- Also, from the independence of the 
edges and < afn for integer p> 2 , we get that 


2m k 




e(7) 


2=1S=1 

Hence, using Lemma 17 , we obtain for k < 

fem+1 km 


v—e—1 


S < Y. T. |Wfc,„Ke)|(^)V 

v—3 e—v—1 

/cm+1 km 

< na^'^ Y 

V—3 e—v—1 

< (£m)Y( 

s=0 ^ 

For our choice of m in ( 35 ), we have, for n large enough, 

=o(logn)7 £m = o(logn)2 and (2£m)®’" < ni2/i3_ 
In particular, the above geometric series converges and ( 38 ) follows. 


( 39 ) 


( 40 ) 


□ 


6.2 Bound (31) on 

Proof. The bound ( 31 ) on ||A^^^x|| we will now establish improves by a factor y^n on the trivial 
estimate < ||x||||A(^^||. Its proof parallels the argument used to show ( 30 ). We have 

E||A('=-i)xf = E ^ 

e,f,g 

76W"i i=l s=l 

where lU^i i® pairs of paths (71,72) such 7^ = (770; ■ • ■; 7i,fe) is non-backtracking 

and (7i,fc-i, 7 i,fc) = (72,1,72,0) and each edge is visited at least twice. The only difference with 
Wk,i defined above is that we do not require that (71,0,71,1) = (72,*,72,fe-i)- However, this 
last condition (71.0,71.1) = (72m,fc, 72m,fc-i) was not used in the proof of Lemma 17 . It follows 
that the set of canonical paths in Wj! with v distinct vertices and e distinct edges has cardinal 
bounded by A;^( 2 fc)®(®“'"+^i. Since the paths are connected and each edge appears at least twice, 
we have u — 1 < e < fc. As in the proof of ( 30 ), we get from ( 40 ) with m = 1 

E||A('=-i)xf < Cna'^(logn)^. 

We conclude with Markov inequality and the union bound. □ 


24 



6.3 Bound (32) on ||-R[^^|| 

For n > 3 , we set 

logn 

7TL — - 

L 251 og(logn)_ 

For 0 < k < i — we have that 

= E n 

'y^'^e,rr,,k ®=1 s=k+2 

where is the set of sequence of paths 7 = (71,..., 72m) such that = (7^,0, • • • , 7 i,k) 

and 7^ = {ji^k+i,’ ■ ■ are non-backtracking tangle-free, 7i = (jljjf) is non-backtracking 

tangled and for all odd i £ 2m}, 

( 7 i,o, 7 i.i) = ( 7 i-i,o, 7 i-i.i) and (7i7-i,7i7) = ( 7 i-Hi 7 -i) 7 i-Hi 7 )> 
with the convention that 70 = 72m ■ 

We define G{j) = (1^(7), E{j)) as the union of the graph G(7f), 1 < i < 2 m, z € { 1 , 2 }. Note 
that the edges (7i,fc,7i,fc+i) are not taken into account in G{j). As usual, we set ^(7) = |T^(7)| 
and 6(7) = |A(7)| > r'(7). Since 7^ is tangled either (a) G{ji) contains a cycle and is connected 
or (b) both G{jI) and G{'jf) contain a cycle. In particular, all connected components of 0(7) 
contain a cycle and it follows that 

v{j) < e(7). 

Taking the expectation in ( 42 ), we find that 

2m k I 

^ n (43) 

i—1 s—1 s—k-\-2 

where T£^m,k is the subset of 7 e ^ such that 

■^(7) < 0(7) < km + 2 m(£ — 1 — k) = m{ 2 £ —2 — k). ( 44 ) 

Indeed, for the contribution of a given 7 in ( 43 ) to be non-zero, each pair {7i^s_i,7i_s}, 1 <i < 
2 m, 1 < s < A:, should appear at least twice in the sequence of the 2 {£— I)m pairs {7i,s-i) 7i,s}) 
s ^ A; -I- 1 . 

Lemma 18 (Enumeration of canonical tangled paths). Let 7 },m,fc(r’, e) &e the set of canonical 
paths in Ti^rn,k with v(j) = v and e{j) = e. We have 

\n^Av,e)\ < ( 4 £m)i 2 ™(--+i)+ 8 -. 

Proof. We will adapt the proof of Lemma 17 and use the same terminology. We start by 
reordering 7 S 7 },m,fc into a new sequence which preserves as much as possible the connectivity of 
the path. First, we reorder 7 = (71,...,72m) into 7 = (xi ,..., X2m) by setting for i odd, 7 = 7 i 
and for i even, 7 ^^^ = Also, for i odd, we set ki = k and for i even ki = £ — k — 1 . Finally, 


(41) 


(42) 
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we write 7 * = ( 7 ', 7 ") with 7 ' = ( 7 ^, 0 , • • • ,7z.feJ and 7 " = ( 7 j,fc,+i,.. -ni/)- To each i, we say 
that 7 i is connected or disconnected whether G{j") intersects the graph Hi = Uj<iG( 7 j)UG( 7 -) 
or not. If 7i is disconnected, we define for 0 < < < £, Xi^t = li,t- If 7 i is connected, for 0 < t < ki, 
we set Xi^t = 7 i,t; and if qi > ki is the first time such that 7 i_q. G Hi, we set for + 1 < t < 

Xi,t = 'Iqi+ki+i-t and for Qi + 1 < t < £, Xi^t = li,t- We then explore the sequence (xi^t) in 
lexicographic order and set yi^t = Xi^t+i\- The definition of first time, tree edge and excess 
edge carry over, that is {i,t) 7 ^ (i,ki) is a first time if the end vertex of Xi^t+i has not been 
seen before. When 7 ^ is connected, we add the extra mark {qi,^qi), if 7 i is disconnected this 
extra mark is set to 0. With our ordering, all vertices of I^(7)\{1} will have an associated tree 
edge, at the exception of Xi^ki+i when 7 ^ is disconnected. If 5 is the number of disconnected 
7i’s, we deduce that there are 5 + e + v — 1 excess edges. Note however that there are at most 
e = e + u — 1 excess edges in each connected component of G( 7 ). 

We may now repeat the proof of Lemma 17. The main difference is that, for each i, we use 
that 7' and 7" are tangle free, it gives short cycling times and long cycling times for both 7' and 
7". For each i, there are at most 2 short cycling times and 2(e — 1) long cycling times. Since 
there are at most ways to position these cycling times, we arrive at 

\Tl,mAx,e))\ < 

where the factor (2t'u)^™ accounts for the extra mark. Using v < 2£m, we obtain the claimed 
statement. □ 


Proof of bound (32). From (43), it suffices to prove that 

2m k t 


< (G log 71)24"*0,(2^-'=)™. (45) 


■yeTt^rn.k i=lS=l 

As in (39), we find 

2m k 


s—k-\-2 


mm .,.-.-,., n < (^)“”. 


i—1 s—1 s—k-\-2 

From (44) and Lemma 18, we obtain 

m{2l-2-k) m{2l-2-k) 


S < 


Y l'^.m.fc(u,e)|(^) 77- 


v—1 e—v 

m{2i—2—k) 00 

< ^ (4.^777) 12m(e-t))+20m^«-e 


V—1 e—v 


< a(2^-'^)’"(4£777)20™(2£777)y^ 


s=0 


(4^r 


(46) 


For our choice of m in (41), we have £m = o(}ogn)^ and (d^m)^^"* < and (45) follows. □ 


6.4 Bound (33) on 

Since ||Ar|| is of order n, we observe that the second statement in (33) improves by a factor y/ri 
the crude bound \\KB^'^'>\\ < ||ATlj||. 
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Proof. We only prove the second statement. The first statement is proved similarly. The 
argument again parallels that used to show (30). We take m as in (35). We have 

m k k—1 

~ 'y ' ^72i,<,-l72i,a (^7) 

7^2—1 s—2 s—1 

where Wk^m is the set of sequence of paths 7 = (71,..., 72m) defined below (36). From (47) and 
Markov inequality, it suffices to prove that 

m k k — 1 

E EH n ^72.-1.-172.-1.11 "^72.,a-i72..a log( 48) 

-yeWk.rrt i-1 S=2 S=1 

If 7 G yVk,m is a canonical element of Wk^m then ^(7) — 1 < 6(7) < 2 km and ^(7) > 3. Also, 
any 7 S Wk,m is isomorphic to less that elements in Wk,m- Moreover, we have that 


m K K—L 

E J_ J_ J_ J_ 24-),2i-i,a-i72i-l. I (49) 


indeed, for any p > 1, EAg „ < a/n and, since (721+1,0,721+1,1) = (72i,fe, 72i,fc-i) at most m 
distinct edges are covered by the union of {721-1,0,721-1,1} and {72i,fc-i,72i,fc}- Hence, using 
Lemma 17, we obtain 


km-\-l 2km 


S S ( = ) E E m„(,.e)\Q n^ 

v—3 e—v—1 

fcm+1 2km 

V—3 e—v — 1 

"+la(2'=-l)™(£)2’"(^TO)£ 


< n" 


s =0 


For our choice of m in (35), the above geometric series converges and (48) follows. □ 


6.5 Bound (34) on ||S'f^|| 

Proof. Observe that L^f = 0 unless e = /, K^f = 1, Kj-i^ = 1 or K^f-i = 1 in which cases 
Lef = — 1. We may thus decompose 

L = -I-K', 

where I is the identity, and the non-zero entries of K' are equal 1 and are the pairs (e, /) such 
that Kgf = 1, ATj-ig = 1 or K^f-i = 1. Thus 

Bounds (30)-(33) imply that the first term has a smaller order than the intended bound (34). 
Hence we only need to bound the last term. We use again the method of moments. We observe 
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that K'^j < Kef + {PK)ef + {KP)ef. A straightforward adaptation of the proof of bound (33) 
shows that w.h.p. , for any 1 < fc < £ — 1 , 

< v^(logn)i°a'=/^ 


which concludes the proof. 


□ 


7 Stochastic Block Model : proof of Theorem 4 


In this section, we give the strategy of proof for Theorem 4. Let £ = £{n) ~ «:log„ n for some 
K S ( 0 , 7 / 6 ) as in Theorem 4. Recalling the definition (14) of vector Xk, "we further introduce 
for all fee [r]: 


B^Xk 

WxkW 


Ok = IlsVfcll, 


(50) 


and 

B^g>k B^B*^Xk 

Ok \\B^B*(xk\\' 

(in the above, if O/, = 0 , we set Cfe = 0 )- We also define 


H = span((/fc, k e [r]). 


We then have the following 
Proposition 19. For some b,c > 0, w.h.p. 

(i) b\p,l\ < < c\fj.i\ ifkG [ro], 

(a) sigB.{pl){Ck,<pk) >bifkG [ro], 

(Hi) 0k < (log n) ^ 0^/2 g [r]\[ro], 

(iv) \{(pj,ipk) \ < (logn)^a3^/2^-7/2 ^ ^ j g 

(v) 1(0, ^fc) I < (logn)=a2^n-'^/2 if k ^ j e [ro]. 

rW 1(0,0)I < (logn)'=a5^/2„-7/2 

Proposition 19 will follow from the local analysis done in Section 9. The next Proposition 
will be established in Section 10 using a matrix expansion together with norm bounds derived 
by combinatorial arguments parallel to the proof of Proposition 11 for the Erdos-Renyi graph. 

Proposition 20. For some c > 0, w.h.p. 

sup ||S^a:^|| < (logn)°a^/^. 

xG//-'-,||a:|| = l 

We now check that the two preceding propositions imply Theorem 4. We consider {(pi, • • • , (pr>) 
obtained by the Gram-Schmidt orthonormalization of {tpi, • • • , tpr). By Lemma 9 and Proposi¬ 
tion 19(iv), w.h.p. r' = r and for all fc G [r], 

\\Fk - Fk\\ = 0((logn)^a^^/^n“'^/^). (51) 
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Similarly, for k G [ro], we denote by C,k the orthogonal projection of C,k on the orthogonal 
of the vector space spanned by (pj, j G [ro], j ^ k and Q, j < k. We set (k = Cfc/IICfc||- From 
Proposition 19(v)-(vi), we find w.h.p. for k G [ro], 

IlCfc - ail = OiilognYa^^/^n-^/^). ( 52 ) 


We then set 


Since \\ifk-<Pk\\ 
all fc G [r]. 


ro 

do = J2 

k^l 

o(l), from Proposition 19(i)-(iii), we find by induction on A: € [r], w.h.p. for 
\\B^Vk\\=0{a^). 


Consequently, from Proposition 20, we have w.h.p. 


\\B%=0{a^). 


In particular, since Doipk = OkCk = B^f>k + ^kiCk - Ck), we get for k G [rp], 

||.B^(pfe - i:)o<^fc|| < ||■B^||||‘?fc - (^fcll + ||^^<,5fc - Do^^fcll + ailCfc - all 

= 0((logn)"a^^/2n-'^/"). 

We have = n'^''/2+°(i)-7/2_ Since 0 < k < 7/6, 7k/2 — 7/2 < k/2, we thus obtain, if 

Po is the orthogonal projection of Bq = span((^fe, k G [ro]), 

|lP^Po-Coll = 0 (a^/"). ( 53 ) 

We also set Di = B^Pi where Pi is the orthogonal projection of Hi = span((^fc, fc € H\[^o]) 
and C = B^ — Dq — Di. Arguing similarly, from Proposition 19 (iii), w.h.p. , for k G [r]\[ro], 

llPK^fcll = llP^fcll < WB^WWk - ^k\\ + WB^^kW = 0{{\ognra^/^). 


Hence 

||Pi|] = 0((logn)V/2). (54) 

Also, let y G with jjyjj = 1. We write y = x + Hq + hi with x G H-^, hi G Hi, 
ho £ Ho = span((^fc, fc G [ro]). We find 

||C2/|| = ||P'a. + (P'-Po)/ioll< sup |lP^x|| + ||P^Po-Poll. 

xeH-T ,\\x\\=i 

Hence, Proposition 20 and (53)-(54) imply that w.h.p. 

||C|]= 0 ((logn)V/ 2 ). 

We decompose B^ = Do + R with P = C + Pi, from what precedes w.h.p. 

IIPII =0((logn)V/2). 

We are now in position to apply Proposition 8. From (52), the statement of Proposition 
19(ii) also holds with Cfc replaced by Ck- It readily implies Theorem 4. 
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8 Controls on the growth of Poisson multi-type branching 
processes 

In this section we derive results for multi-type Galton-Watson branching processes with Poisson 
offspring that will be crucial for the local analysis of Section 9. We refer to Section 3.2 for the 
notation used below. 


8.1 Two theorems of Kesten and Stigum 

We consider a multi-type branching process where a particle of type j S [r] has a Poi(Mij) 
number of children with type i. We denote by Zt = (Zt(l), ... ,Zt(r)) the population at gener¬ 
ation t, where Zt{i) is the number of particles at generation t with type i. We denote by Tt the 
natural filtration associated to Zt- Following Kesten and Stigum [18, 17], we have the following 
statement. 

Theorem 21. For any k G [rg], 

Xk(t) = - {(j)kTZQ), 

is an J-t-martingale converging a.s. and in such that for some C > 0 and all t > 0, 
EXk{t) = 0 and E[Xl{t)\Zo] < C||Zo||i. 


Proof. We include the proof for later use. For 0 < s < t, we have 

t-i 

Zt - M^-^Zs = Y, - MZ,), 

U — S 

so that, as 


{(t>k,Zt) 

l^l 


{(fk, Zq 


E 


{4’k, {Zu+1 — MZu)) 


Fk 


- 1-1 


(55) 


It follows easily that (Xfc(t)) is an J't-martingale with mean 0. From Doob’s martingale con¬ 
vergence Theorem, the statement will follow if we prove that for some C > 0 and all integer 
t > 0, 

E[x2(t)|Zo] <C|iZo||i =C(l,Zo). 

To this end, we denote by Zs+i{i,j) the number of individuals of type i in the s-|-l-th generation 
which descend from a particle of type j in the s-th generation. Thus X)jG[r] Zs+i (ij) = Zs -i-i(*)- 
We then have 


E[||Z,+i-MZ,||2|Z,] 


E® 


(Z,+i(*) - E M.jZs{j)?\Zs 

ieW 


E E 


{Zs+i{i,j) 


M,,Zs{g)f |Z«(j) 


E! (J) 

(1,MZ,), 


( 56 ) 
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where in the penultimate equality we used the fact that the variance of a Poisson random 
variable equals its mean. It follows that 


E [|lZ«+i - I Zo] = (1 ,M^+^Zq}. 

The Perron-Frobenius Theorem implies that the matrix converges elementwise to 

as s —>■ oo. In our case, (f)i = \, if)i = and = 1. Consequently, for 

some C > 1, 

E [||Z«+i - MZ,||2| Zo] = (1+ o(I))(l,Zo)/if 1 < C(1 ,Zo)m?+^ 

Hence finally, 

mi{t)\zY\ 


Since Yk > the above series is convergent. □ 


= E 


s=0 


^ E 


s=0 


E[(./>fc,(Z,+i-MZ,))"|Zo] 

, 2(s+l) 

|^fc||iE[HZ,+i-MZ,||i|Zo] 
, 2 (^*+l) 


< C(l,Zo)^ 


t—1 / \ s+1 

Ml 


We also need to control the behavior of {(j)k, Zt) for k G H\[ro]. The next result is contained 
in Kesten and Stigum [17, Theorem 2.4]. 

Theorem 22. Assume Zq = x. For k G [r]\[ro] define 


Xk{t) = 


{<i>k,Zt) 

t/2 

{4>k,Zt) 


*/ kl < kl 
*/ kl = kl- 


Then Xk{t) converges weakly to a random variable Xk with finite positive variance. 

Note that Theorem 2.4 in [17] expresses Xk as a mixture of Gaussian variables. The nor¬ 
malization in the case Yt — ki comes from the fact that M is diagonalizable, and hence all its 
Jordan blocks are of size 1. 


8.2 Quantitative versions of the Kesten-Stigum Theorems 

We will also need probabilistic bounds on the growth of the total population at generation t 
defined as 

St = \\Zth = {f,,Zf). 

We observe that (11) implies that St itself is a Galton-Walton branching process with offspring 
distribution Poi(/ri). 

Lemma 23. Assume So = 1. There exist co,ci > 0 such that for all s > 0, 

P(Vfc >l,Sk< sp\) > 1 - Cie-"“C 
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Proof. For k > 1, we set 


ek=Hi''^^Vk and fk = Y[{l + £()- 


1=1 


It is straightforward to check that fk converges, hence there exist constants cq, ci >0 such that 


for all fc > 1 , 


Co < fk < Cl and Sk < ci. 


(57) 


Using Chernov bound, if Yi are i.i.d. Poi(/ii) variables then, for any integer i > 1 and positive 
real s > 1 , 


(58) 


^ 2=1 


where we have set 7(5) = slogs — s + 1. In particular, on the event {Sk < s/feMil G -^fc, we 
have 

P(S'fe+i > sfk+ifii^^\Tk) < = e-°o(fc+i)s, 

where we have used the existence of some 9 > 0 such that for x G [0, ci], one has 7(1 + 0 ;) > 9x^. 
Finally by our choice of Sk and (57), if s > max(l/cQ, 1/ci), 


P(3fc : Sk > scim5^+i) < < - 


e 0 


— e 0*^ 


Hence we deduce the statement of the lemma for some (suitably redefined) constants co,ci > 

0. □ 

A key ingredient in the subsequent analysis will be the following result, which bounds by 
how much the growth of processes s —>■ {(fk, Zf) deviates from a purely deterministic exponential 
growth. 

Theorem 24. Let /? > 0 and Zq = x G W be fixed. There exists C = C{x, j3) > 0 such that 
with probability at least 1 — n~^, for all k G [rp], all s,t > 0, with 0 < s < t, 

\{fik,Zs) - fir*{(t>k,Zt)\ < C{s + l)fil^\\ognf/^. 

and for all k G [r]\[ro], all t > 0, 

{{(fk, Zt)\ < C{t + l)^/o‘/^(logn)^/^. 

Finally, for all k G [r’]\[ro], all t>0, E\{(fk,Zt)f < C{t + l)^fi\. 

Proof. We start with classical tail bounds for Y = Poi(A). From (58) for s > 0, 

P(y-A > As) < 

with 7 (s) = s log s + 1 — s. Similarly, for s < 1 one has 

P(r-A<-As)<e-^'>'(i-'*), 
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where by convention j(x) = +oo for x < 0. Let S(x) := 7(1 — x) A 7(1 + x). Then for any s > 0, 

P(|y- A| > As) < 

In particular, for any i G [r], letting y := MZt, we have, if Zt ^ 0, 

p(|Zt+i(i) - yii)\ > s\\y\\l^^ I Tt) < 

1/2 

Consider first the case where s||y||i' < y{i). As there exists 9 > 0 such that for all x G [0,1], 
6{x) > 9x^, we get 


p(|Zt+i(z) - y(z)| > I Tt) < 26-"^ < 2e-^^\ 

1 /2 

Consider now the case where s||?/||^' > y{i). As there exists 9' > 0 such that, for all x > 1, 

6{x) > 9'x, we get 


F(\Zt+iii) - yii)\ > s\\y\\Y^ \ Ft) < 

since Zt ^ Q implies that ||y||i > yi from (11). Thus there exists some cq > 0 such that, for any 
s > 0, 


p(||z,+,-A(Z,||2>»||Z,||;''" 


^.)<Ep 


\Zt+iii) - y{i)\ > 


s\\Zt\\\'^ 





If Zt = 0, then Zt+i = 0 and the same bound trivially holds. We thus obtain the existence of 
constants co,ci >0 such that, for any it > 1, 

P(vt > 0, \\Zt+i - MZth < u{t + 1) \ogn\\Zt\\y^) > 1 - ^ > 1 - cin-"““. (59) 

t>l 


Now, from (55), for any 5, 0 < s < i. 


t-i 

^s) ~ Mfc {4*ki ^t) I ^ f^k ^ ^ 

h—s 


UkhWZh+i-MZhW 


2 


From Equation (59) and Lemma 23, for C large enough, with probability at least 1 — n~^, 
we have for all /i > 0 that \\Zh+i — MZh \\2 < C(logn)(h + l)||Z/i||y^ and ||^?t||i < C{\ogn)fj,i. 
On this event, we get, for k G [xq]. 


|(<^fe,Z,) < C'{\ognf/^^ll'£ih + l)(y^) < C''i\ognf/\s + l)yf, 

h=s \ 9-k J 


where at the last line, we used that /i^. > yi and J2h>s — c(a)sa® for 0 < a < 1. Similarly, 
on the same event, for k G [x]\[ro], from (55), for t > 1 and s = 0, 


l(A. z.) - Zo)| < ,4 E II^AbllZ.^ - AfZ„||, ^ c'dognf'V, + 1 )(^ 

11=0 «=0 

Using now yl < yi, we have E1=U“ + 1)(^) = y.kf). 
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For the last result, we define 


rr W^t+l - MZt\\2 

U = sup- 

From (59) (with n = 2), for any p > 1, Et/^ = 0(1). We obtain from (55) and Cauchy-Schwartz 
inequality 




< 


< 






E® 

i -1 

E® 

s =0 


UkWlWz.+^-MZ, 

, 2 (^<+l) 

Pfc 

u\s+inzA\i 

2(s+l) 

^^k 


s=0 


VWWi 

2(s+l) 

Mfc 


o(tVi), 


where for the last equality, we used the fact that EUZsIli = which follows from Theorem 

21 with fc = 1 (recall that (j)i = 1 ), and the bound of 0 (tp‘/p^‘) on the sum, which holds for 
k ^ [ro]. □ 


8.3 A cross-generation functional 

For the subsequent analysis, in order to control the law of the candidate eigenvectors B^B*^Xk, 
we also need to consider a functional of the multi-type branching process which depends on 
particles in more than one generation. More precisely, assuming that ||^o||i = 1; we denote 
by V the particles of the random tree and o G Id the starting particle. Particle v G V has 
type a{v) G [r] and generation |u| from o £ V. For v G V and integer t > 0, let denote 
the set of particles of generation t from v in the subtree of particles with common ancestor 
V GV. Finally, Z^ = (Z)'(l), • • • , Z^{r)) is the vector of population at generation t from v, i.e. 
ZHi) = 'Eu&y- = *)■ We set 

s^ = \\z^h = {</^,,z^). 

With our previous notation, Z° = Zt, S° = St- We fix an integer A: G [r], .^ > 1 and set 

Qk,e = E (j)k{cr{u2i+i)), (60) 

('Uo,...,ll2f + l)e'P2f + l 

where the sum is over (uq, ... ,U 2 e+i) G P 2 £+i, the set of paths in the tree tree starting from 
Ug = o of length 2 ^ -1- 1 with (ug,..., u^) and (u £,..., U 2 £+i) non-backtracking and ug-i = 

(i.e. (ug,..., U 2 £+i) backtracks exactly once at the £ + 1 -th step). 

The following alternative representation of Qk,£ will prove useful. By distinguishing paths 
(ug,..., U 2 ^+i) according to the smallest depth t G {0, ...,£— 1} to which they climb back after 
visiting and the node U 2 £-t they then visit at level t we have that 

£-1 

= E E (61) 

£—0 u^Y° 
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where we let for \u\ = t > 0, 


wGY^ 

We then have 

Theorem 25. Assume Zq = 5x- For k G [ro], Qk,i./kY converges in Li^ as I tends to infinity 
to a random variable with mean ^k4>k{x)/{ifj./a — 1). For k G [r]\[ro], there exists a constant 
C such that . 

Proof of Theorem 25. Let Ft be the filtration generated by (Zg,..., Zt). The variables (L^ i^uG 
Yf) are independent given Ft- We will show that the sum (61) concentrates around its mean. 
Let us first compute the mean of for u G Yf. We use the fact, that given Ft+i and 
V w G Yffi, Zf and are independent. Hence we have with the short-hand notation 

{v,w)^Y.^ ,v^w 

By assumption (11), Moreover, we have Ejr^_^_^{cj)k, Zf) = yt\{(j)k,ZQ) so 

that 



E^,L“, = a'-‘-VlE^, (|r“| - 1) E 

= a^-‘-VlE^.((|l^i“l - l)in“l) E 

a 

iG[r] 

= ( 62 ) 

and 

E (63) 

u&Y- 

where Yk{t) = Xkft) + {4>k, Zf) and Xk is the centered martingale defined in Theorem 21. 

We now prove the statements of the theorem for k G [rg]. We find similarly 

Varjr^ (L^^^) = 

< CE,(^k,Zt)^E,Sl,_^, 

where E*(-) = max^gj,,] E(-|Zg = di) and constant C can be taken equal to E*|Y]°|"^. For k G [rg], 
we deduce from Theorem 21, that for some new C > 0, 


Var^, 



= E Var^,(L^_,) < CyP^a^P-^'^St. 

ueYf 


(64) 
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We now define 


Qk,, = E E = E (65) 

t—0 u^Y° t—0 

Since k G [rp], pk '■= p'i/o. > 1. We write 

Qk,l t-tv 

21 —k‘k2_^Pk yk{t)- 

f^k t=0 

From Theorem 21, Qk,t/p^ converges a.s. to pkXki'X))/{pk — 1) where Yfc(oo) = Xk{oo)+{(j)k, Zq) 
and Xfc(oo) is the limit of the martingale defined in Theorem 21. Moreover, Qk,t/p\^ also 
converges in Y. Indeed, we find easily from Cauchy-Schwartz inequality, 

\Pk Pk k / y 

< pI fE/^H (^2^iyk{t) - n(oo))VE 

\t=0 / \t=0 

Since pk > 1, the first term of the above expression is of order 0(1). For the second term, from 
Theorem 21, for any e > 0, there is to such that for all t > to, E{Yk{t) — Yk{oo))'^ < e. We find 
that the second term is 0 (e + p*k~^) = o(l)- It proves that Qk,e/Pk^ converges in Y. 

We now check that Qk,e and Qk,i are close in Y for k G [ro]. For a real random variable Z, 
set II^11 2 = VEZ^. From (61)-(64) and the triangle inequality, we get 

\\Qk,i — Qk,e\\2 < 




E 

E li,-e^, e 

t^o 


i-l 

/ / \ \ 

E 

Var^, E 

t^o 

\ / / 


oEM‘fc«^-‘llv^ll2 = 0{pW/^) = 0{pf), 

t =0 


where at the last line, we have used that Lemma 23 and k G [rg]. It follows that ||(Qfe,^ — 
Qk,i)/pjf \\2 goes to 0 and it concludes the statements of the theorem for k G [rg]. 

For k ^ [rg], we note that EZ"^ < E(EyZ)^ + EVary(Z) so that ||2’||2 < ||EyZ ’||2 + 
||Vary(Z')^/^|| 2 . From (61) and the triangle inequality, we get 


t-i 


WQkAU < E 


t=0 

1-1 


^ E 


t =0 


E ^kA 

uevp 


I E 


k,e 


The last statement of Theorem 24 and (64) give 





( 66 ) 


Var^^ 
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We deduce from (66) and (63), 


i-i 


\\QkA\2 <CY. [Ak<^^-A{4>k,Zt)h + = 0{aH^I\ 


t=o 


It concludes the proof. □ 

We finish this section with a rough bound on Qk,t- 
Lemma 26. For any p>l, there exists a constant C = C{p, a) > 0 such that for any k G [r], 

E\Qk,i\^ < 

Proof. We use the notation of Theorem 25. First, from Lemma 23, for any p > 1, < Ca*^. 

In particular, for any v S Yt+i and fc € [r], 

= 0(E^,+,(5r)") = 0 (a‘^). 

We use twice the bound | YAi=i J27=i AiY’■ We find 






udYF 




uGYF 


< sr^ Y 


5] {cfk,zf)sr.t-i 

uGY° {v,w)GY^ ,v^w 

< Sf-^ Y E.F*(5'r)^'^”^^ Y Ca*Pa^^-*-^'>P 

U^YP (v 

< Y Ca'^^-^^PW.^FASif^ 

uGYp 

< C'a^PpP, 

for some new constant C depending on a and p. We deduce that for some new C > 0, 


E 




uGYF 


< CPa^‘^+*'>P. 


For a real random variable Z, set \\Z\\p = (EZP)^/^. We use (61) and the triangle inequality, we 
get 


t-i 


||Q/c,.^||p 5: "y ^ 


t=0 


E 


uGYF 


i-1 


< =Oia^A- 


t=o 


□ 


8.4 Decorrelation in homogeneous Galton-Watson branching processes 

We now establish that the variables Qk,e and are uncorrelated when k ^ j. To this end we 
need the following lemma. 
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Lemma 27. Assume that the spin cr(o) at the root node o is distributed according to the sta¬ 
tionary distribution tt. Conditionally on the branching tree T, the process of spins cr(u) attached 
to the vertices of the tree is a Markov random field. For any two neighbor nodes u,v of'T and 
any i,j € [r], one has the following transition probabilities 

P(cr(u) = i\a{v) = j,T) = ^Mij. 

For any two (possibly equal) nodes u,v ofT, any i,j € [r], i ^ j, one has 

E{(l}kicr{u))(l)j{a{v))\T) = 0. (67) 

Proof. By standard properties of independent Poisson random variables, conditionally on the 
spin (t(o) and on the number of children of the root o, then the spins of each of the children of 
the root are i.i.d., distributed according to M.o.(o)/Qf. Moreover, tt is the stationary distribution 
for this transition kernel, which is reversible, as follows from the relation M = IIW and the 
facts that W is symmetric together with the assumption (11) that the column sums of M all 
coincide with a. The Markov random field property and the expression of the transition kernel 
follow by iterating this argument. 

We now evaluate the conditional expectation in (67). Let ui = u,... ,Ut = v denote the 
unique path in T connecting nodes u and v. Let Fg denote the u-field generated by T and the 
spin variables cr(Mi),..., (j(us). We then have by the Markov random field property 

E((/>j(cr(us+i))jFs) = V -M,^(„^)(^j(i) = ^(j>j{a{us)), 

iG[r] 

where we used the fact that (fj is a left-eigenvector of M associated with eigenvalue p,j. Thus 

icr{v))\T) = E{(fki(7{u))(l)j{a{u))\T) 

= 0 , 

where the last equality follows from 7 r-orthogonality (9) between vectors 4>k and (fj for j k. □ 
We now show the following 

Theorem 28. Let j k £ [r] and Zq = 6^ where b has distribution ( 7 r(l),... , 7 r(r)). Then for 
any i > 0, 

EQk,eQj,i = 0. 

Proof. Write Qk,i as 

Qk,i= ^ (j)k{<j{w)), 

{v,w)^V{T) 

where the sum extends over a set 'P(T) of node pairs {v,w) that depends only on the tree T. 
Using the analogue expression for Qj i one obtains 

E{Qk,iQj/\F) = X((ii,'!u)g'P(T) X((jj',tu')GP(T) ^('Pk{o'{w))(pj{a{w'))\F) 

= 0 

by Lemma 27, Equation(67). This concludes the proof. □ 
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9 Local structure of random graphs 


We now derive the necessary controls on the local structure of the SBM random graphs under 
consideration. Coupling results will allow to bound the deviation of their local structure from 
branching processes. Asymptotic independence between local neighborhoods of distinct nodes 
will then be used to establish weak laws of large numbers. 


9.1 Coupling 

For e, / G E{V), we define the ’’oriented” distance 

d(e,f) = min £( 7 ) 
7 


where the minimum is taken over all self-avoiding paths 7 = ( 70 , 71 , • • • ,7£+i) in G such that 
( 70 , 71 ) = e, ( 7 ^, 7 ^-i-i) = / and for all 1 < /c < £ -b 1 , {'^k,lk+i\ G E (we do not require that 
e G E). Observe that d is not symmetric, we have instead d{e,f) = d{f~^,e~^). 

Then, for integer £ > 0, we introduce the vector Yt[e) = (l^t(e)(i))ig[r] where, for i G [r], 


lt(e)(0 = ||/ G A : d(e,/) = t,a{f 2 ) = *}|- 


( 68 ) 


We also set 


St{e) = \\Yt{e)h = \{fGE:d{e,f)=t]\. 


The vector Yt{e) counts the types at oriented distance t from e. 

We shall denote by St{v) the set of vertices at distance t from v. We introduce 


n{i) = ^l(cr(u) =i), 

V — 1 


«n(*) = 

i=i 


7r„(z) 


n(0 

n 


an = maxa„(z) = a + 0(n '*), 

ie[r] 


(69) 


where at the last line we have used Assumption (11)-(13). Central to our local study is the 
classical exploration process of the neighborhood of v which starts with Ag = {u} and at stage 
£ > 0 , if At is not empty, takes a vertex in At at minimal distance from v, say Vt, reveals its 
neighbors, say JVt+i, in [n]\At, and update At+i = {At U Nt+i)\{vt}. We will denote by Et 
the filtration generated by (Ag, • • • , At) and by Dt = Uo<s< 4 Ag the set of discovered vertices at 
time £. We start by establishing a rough bound on the growth of St- 

Lemma 29. There exist cg, ci >0 such that for all s > 0 and for any w G [n] U E{V), 


P(V£ > 0 : St{w) < sal^) > 1 — Cie 


Consequently, for any p >1, there exists c > 0 such that 

E max < c(logn)^. 

\ al, J 
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Proof. Recall that = p a:P“^P(|X| > t)dt and P(max„ Xt, > t) < 1 A — 0 - 

Then the second statement is a direct consequence of the first statement. 

To prove the first statement, observe that, in the exploration process, given Pf, if Vt has type 
j, the number of neighbors of Vt in [n\\Dt is upper bounded stochastically by 

r 

i=l 


where Vij = Bin{n{i),Wij/n) = Bin(n(i),7r„(i)Wij/n(i)) are independent. In particular, for 
any 0 > 0, using 1 +1 < e*, we find 


Ee 


dVi 


=nE 1- 


T^n{i)Wij TTn{i)Wij 


i{i) 


i{i) 




n{i) 


< g-«n(i)+a,xO)e® ^ g-Q„+a„e'’ 


For any j € [r], we have thus bound for 0 > 0 the characteristic function of Vj by the 
characteristic function of a Poi(Q;„) variable. It remains finally to repeat the proof of Lemma 
23 from (58) with pi replaced by q:„. □ 


We now check that the random graph G is locally tree-like. For v G [n] and integer h > 0, 
we denote by (G, v)h the rooted subgraph of G rooted at v, spanned by the vertices at distance 
at most h from v. If e = {u,v) G E{V), we set {G,e)h = {G',v)h where G' is the graph G with 
the edge {u, u} removed (if it was present in G). 

Lemma 30. Let I ^ Klog^n with k < 1/2. Then, w.h.p. the random graph G is i-tangle-free 
and w.h.p. there are less that d^logn vertices whose (.-neighborhood contains a cycle. 


Proof. We start by proving the second statement. In the exploration process all vertices in 
V (G, v)e have been revealed at time r. Let r be defined as the first time t at which all nodes 
at distance £ or less from v have been discovered. It is clearly a stopping time for the filtra¬ 
tion Pf. By construction, given P^., the set of discovered edges in V{G,v)g builds a spanning 
tree of V{G,v)i. Also, given P^, the number of undiscovered edges between two vertices in 
V{G,v)i is stochastically upper bounded by Bin(m,a/n) where m = \V{G,v)i\ = Si{v) and 
a = maxij W{i,j). It follows from Lemma 29 that, for some c > 0, 

, oESeiv) ca^ 

P((G,u)^ IS not a tree) < - < -. (70) 

Hence from Markov inequality. 


PI E 1((G, v)i is not a tree) > log n 

\ V 

The second statement follows. 

We now turn to the first statement. First recall that the probability that Bin)™, q) is not 
in {0,1} is at most q^m{m — 1) < q^m?. Also, if G is Gtangled, then there exists v G \n\ such 
that V (G, v)i has at least two undiscovered edges. In particular, from the union bound, 



P(G is Gtangled) < 


E 




< 




o(I). 


where c > 0 and we have used again Lemma 29. 


□ 


40 








We conclude this subsection with a coupling of the process Zt{e) and a multi-type Galton- 
Watson tree. Recall that for probability measures P, Q on a countable set X, the total variation 
distance is given by 

clTv(i", Q) = ^ E = minP(X ^ P), 

xGA' 

where the minimum is over all coupling {X,Y) such that X ^ P, Y Q. 

Proposition 31. Let £ ~ Klog^n with 0 < k < 1/2 and e = {u,v) G E(V). Let {T,o) be 
the random rooted tree associated to the Galton-Watson branching process defined in Section 8 
started from Zq = Sa-(v) ■ The total variation distance between the law of (G, e)i and (T, o)e goes 
to 0 as 0((logn)a^n“'>'^(^“”^). The same holds with {G,e)e replaced by {G,v)e. 

Proof. We prove the first statement, the proof of the second statement is identical (see comment 
below (71)). If e = {u,v) and G' = G\{m, n}, we consider the filtration Pt associated to the 
exploration process of {G',v). We let r be the stopping time where all vertices of {G,e)i have 
been revealed. We set yo = Sy and at step t > 0, we denote by yt+i = (yt+i(l),..., yt+i{r)) the 
number of discovered neighbors of Vt in [n]\At of each type. If a^Vt) = j then, given Pt, the 
variables (yt+i{i))i^[r] are independent and yt+i(i) has distribution Bin(n((i), Wy/n) where 

t 

nt{i) = n{i) -^ys{i) - l(t = 0,a(u) = i), (71) 

(the last term comes from the difference between G and G'; this term is not present in the 
case of the second statement on {G,v)i). We perform the same exploration process on {T,o), 
that is a breath-first search of the tree, we discover at each step the offsprings say Xt+i = 
(a;t+i(l),... ,a:t+i(r)) of the active vertex vt- In particular, if vt has type j then the variables 
(a;t+i(z))ig[r] are conditionally independent and Xt+i{i) has distribution Poi(7r(i)Wij). To couple 
the two processes, we shall use the following classical bounds, (see e.g. [2]), 

dxv 1 Bin( m, — ), Poi(A) I < — and dTv(Poi(A), Poi(A')) < |A — A'l. (72) 
\ \ m J J m 

For 0 < t < T, define the event Lit = {\Dt \ < ca^ logn} G Pt, where Dt is the set of discovered 
vertices. By Lemma 29, for c large enough, r < ca^ log n and Ltr holds with probability larger 
than 1 — 1/n. Also, by (70), with probability at least 1 — ca^/n, (G, e)^ is a tree. It follows that 
by iteration, it is enough to check that, if Lit holds, there exists G > 0 such that 

dTv{Pt+i,Qt+i) < (73) 

where Pt+i is the distribution of yt+i under P(-|J't) and Qt+i has the law of Xt+i = (xi, • • • , Xr) 
where Xi are independent with distribution Poi{TT{i)Wij), with j = (j{vt). However from (72) 
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and the triangle inequality, we have 


dTv(-Pt+i, Qt+i) < dxv ^t+i,(X)P™ 


/ nt{i)Wij 


n 


dTvl (g)Poi(^^^^&),Q,+i 




nS) 


-7r{i) 


< 




2=1 


From (13), the latter is 0{n ”)). We thus have proved that (73) holds for some new 

C7 > 0. □ 


We will use the following corollary of Proposition 31. 

Corollary 32. Let I ~ ^log^ n with 0 < k < 7 A 1/2. For e G E{V), we define the event £{e) 
that for all 0 < t < £ and k G [r].- | (c/fc, Yt(e)) — Y^(e))| < (logn)"^a‘/^, if k € [rg], and 

\{(j)k,Yt{e))\ < (logn)^a‘/2, if k G [r]\[ro]. 

Then, w.h.p. the number of edges e G E such that£{e) does not hold is at most (log 

Proof. First, with exponentially large probability there are less that 2Q;n edges in E. From the 
union bound, it is thus enough to prove that for any e G E{V), f‘{£{eY) < C{\ogn)a^n~'^. To 
show this, use the coupling result of Proposition 31 to deduce that with probability at least 
1 — C'(logn)a^n“^, the processes {Yt{e))o<t<e and (^t)o<t<^ coincide. It then remains to use 
Theorem 24 with (3 = 1. □ 


9.2 Geometric growth of linear functions of non-backtracking walks 

For k G [r], we recall that 

Xfe(e) = </fe(cr(e2)). 

The next proposition asserts that for most e G E, {B^XkYe) grows nearly geometrically in t 
with rate fik up to an error of order 

Proposition 33. Let I ^ Klog„ n with 0 < k < 7 A 1/2. There exists a random subset of edges 
El G E such that w.h.p. the following holds 

(i) for all e G E\Ei, 0 < t < £, 

\{B*Xk,Se) - Fl~^{B^Xk,Se)\ < (logn)V/^ if k G [ro], 

\{B*Xk,Se)\ < (logn)V/^ */fc G [r]\[ro], 

(ii) for all e G Ei, 0 < t < i and k G [r], 

\{B*Xk,Se)\ < {lognfa*, 

(Hi) \Ei\ < {lognYa^n^~^. 
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Proof. We define Ee as the set of oriented edges such that either (G, € 2)1 is not a tree or the event 
£{e) defined in Corollary 32 does not holds. Then by Lemma 30 and Corollary 32, w.h.p. E( 
satisfies condition (iii). Moreover, by definition if (G, 62 )^ is a tree 

{B*Xk,6e) = {f>k,Ytie)). ( 74 ) 

and statement (i) follows from Corollary 32. For statement (ii), we simply use that w.h.p. G 
is tangle free (by Lemma 30) , hence, there are at most two non-backtracking walks of length t 
from e to any /. We get 

{B*Xk,Se) < 2||0fc||ooS't(e). 

However, by Lemma 29 w.h.p. for alH > 0 and all e € E, 154 ( 6)1 < G(logn)a‘. □ 

Corollary 34. Let £ ~ ^^loga n with 0 < k < 7 / 2 . With high probability, for any 0 < t < £ — 1 
and k G [r], 

sup \{B^Xk,x) \ < (logn)®n^/^a‘/^. 

{B^Xk,x)=0,\\x\\ = l 

Proof. We write 

{B*Xk, x) = Xe{B*Xk, 5e) + Y ^e{B*Xk, Se) = I + J. 

e£Ei e^Ei 

From Cauchy-Schwartz inequality, the first term is bounded w.h.p. by 

l-^l < (logn)^a‘ Y^ \xe\ < (log n)'^a*'\J\Ei\ < (logn)'^a*a^/^n^^“'^^/^ = 

ef^Ei 

where we have used that and « < 7 / 2 . For the second term, if 

k G [ro], using {B^Xk^x) = 0 , we get similarly w.h.p. 

\J\ < Y \^e\\{B‘^Xk,Se)\+ Y \^e\\{B*Xk,Se) - tJ-i~^{B‘^Xk,Se)\ 

G G El G ^ El 

< (logn)^n^^^a‘^^a^n“^'^^ -I- G(logn)‘*n^^^a‘^^. 

Finally, if fc e [r]\[ro], we simply write w.h.p. \J\ < J2etEi \xe\\{B*Xk,Se)\ < {\ogn)^n^/‘^aB^. 

□ 


9.3 Laws of large numbers for local functions 

We first prove weak laws of large numbers for general local functionals of SBM random graphs 
that will then be applied to specihc functionals of interest. 

9.3.1 Weak laws of large numbers for local functionals: convergence speed 

We start with a general variance bound for local functions of an inhomogeneous random graph. 
A colored graph is a graph G = {V,E) with a map a : V -G [r]. We denote by G* the set of 
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rooted colored graphs, i.e. the set of pairs (G, o) formed by a colored graph G and a distinguished 
vertex o & V. We shall say that a function t from Q* to K is Glocal, if t(G, o) is only function 
of {G,o)i. 


Proposition 35. There exists c > 0 such that if T,Lp ■. Q* ^ M. are i-local, |r(G, o)| < (p{G,o) 
and if is non-decreasing by the addition of edges, then 



Proof. We first bound the expectation of 


n 




u—1 


where S(G, u) is defined as the number of vertices at distance £ from u € V. By Lemma 29, for 
any u € [n]. 



Hence, for any p > 1, for some Cp > 0, 


n 


EZP < nP-^E Y, S2P(G, u) < CpuPo^^P. 


U—1 


Now, for 1 < /c < n, let Xk = {1 < v < k : {v,k} € B}, where E is the edge set of G. The 
vector (Xi, • • • , X„) is an independent vector and for some function B, 


n 


y :=.^t(G,«)=P(Xi,..- ,x„). 


We also define G^ as the graph with edge set Uy^k^v We set 


n 


=Y 'P(^k,u)- 


U—1 


Since r is Glocal, we observe that r(G, u) — T(Gk,u) can be non zero only if u S H((G, k)i) and 
it is bounded by A = 2max„g[„] (p{G,u). Consequently, 


n 


n 




Finally, we conclude by using Efron-Stein’s inequality: Var(y) < EZAf < \/EZ'^\/EK'^. □ 

We now apply the above proposition and Proposition 31 to show that the SBM random 
graph with uniform root selection converges weakly to the multi-type Galton-Watson process 
previously studied. The established convergence implies convergence for the local weak topology 
of Benjamin! and Schramm (see [3]). Crucially we are able to consider local functions with 
logarithmic distance parameter £ and obtain bounds on the convergence speed. 
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Proposition 36. Let I ~ nlog^n with 0 < k < 1/2. There exists c > 0, sueh that if T,(p : 

M are (.-local, |t(G, o)| < (p(G, o) and is non-deereasing by the addition of edges, then if 
Ei^(r, o) is finite. 


E 


1 

-^r(G,w) -Er(T, o) 
n 


< 


OL 




^7/2 


E max p^{G, v) 


1/4 


y{^p\T,o)) 


1/2 


where (T, o) is the random rooted tree associated to the Galton- Watson branching process defined 
in Section 8 started from Zq = (5^ and t has distribution ( 7 r(l),..., 7 r(r)). 


Proof. In view of Proposition 35 and Jensen’s inequality, it is sufficient to prove that 
1 " 

- VET(G,i;)-Er(T,o) 

n 

v—\ 

with /3 = (7 A (1 — k))/ 2. For i S [r], we set Ti = T{Ti,o) where {Ti,o) has distribution the 
the random tree {T,o) tarted from Zq = 6i. Let v G V with a(v) = i. We denote by Xv the 
indicator function that the coupling of {G,v)i and (Ti,o)i described in Proposition 31 is not 
successful. We have, from Cauchy-Schwartz inequality. 


j,G2 ^_/ \ 1/2N 

= 0| — 7 /logn( maxEi /5 (G,w)VE(/) {T,o) 

\vGln] 


(75) 


|Er(G, ?;) - Er/ = |Ex,;r(G, u) - Ex.„ri| 

< 2^{Exv){Ep{G,v)^VEp{T,,o)^) 

= o(^a‘^/‘^n-l^^/\ogn^/Eip{G, u)2 V Eip{T„ 0 ) 2 ) . 

Let vi,...Vr be fixed vertices such that (7{vi) = i (since 7r(/) > 0 such Vi exists for n large 
enough). We recall that Et(G,z;) depends only on (t{v). Hence, using (13) 

E-^r(G,z;) = ^!^Er(G,u.) 

vGV i—l 

r 

= ^ |(7’'(i) + 0(n“'^))ETi + ^J\ogn^Eip{G,ViY V Ep(Ti,oY^ | 

= Et{T, o) + 0^\ogn^/E<p{G, Vi^ V E/9(T,, 0 ) 2 ) . 

It concludes the proof of (75). □ 


9.3.2 Law of large numbers for specific local functions 

We will now apply Proposition 36 to deduce weak laws of large numbers for expressions closely 
related to {B^Xk,B^Xj)j {B'^^Xk, B^Xj) and {B^B*^Xk, B^B*^Xj). Recall the definition of 17(6) 
in ( 68 ). 

Proposition 37. Let I ^ Klog^, n with 0 < k < 7 / 4 . 

(i) For any k G [ro], there exists pk > 0 such that, in probability, 

1 {4’k^Yi{e)Y ^ 

— fM >Pk- 
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(ii) For any k G H\[»"o]j there exists pk > 0 such that w.h.p. 


1 (0fc; ^(e))^ 

an a^ 

eGE 


> Pk- 


(in) For any k j G [r], 


E 


rvn 


e£E 




(iv) For any k ^ j G [r], 


E 


— '^{((k,Y2eie)){(l)j,Ye{e)) 


eGE 


= o{^^l'^n '’'/^(logn 


(v) For any k G [rp], in probability 


1 (0fc) Y2e{e)) {(f)k^Yi(e)) 


Proof. Let Zt, t > 0, he the Galton-Watson branching process defined in Section 8 started from 
Zq = 6i, and i has distribution (7 r(l),..., 7r(r)). We denote by (T, o) the associated random 
rooted tree. If fc G [ro], by Theorem 21, for some pk > 0, 

]E(<^fe, = Pk + o(l). 

We set t{G,v) = J2eGE-.e2=vi^k,Yi{e))^ . We observe that 

^ {((k,Yi{e)Y < ^ S‘i{e) ^Sj+^iv). 

e^E:e2—v e^E:e2—v 

We get, using < a~^, that t{G,v) < p{G,v) := Also, Lemma 29 implies that 

Emaxt, p{G,v)‘^ = 0((log The same upper bound holds for (p{T,o) by Lemma 23. We 

deduce from Proposition 36 that 


E 


1 y^ {4’k,Yi(e))‘^ 
an 

e^E 


E{cfk,Ziy 

op 


= 0(a3^/2 (log n)5/2n-^/2). 


It proves statement (i) of the proposition. 

For statement (ii), we use Theorem 22 instead. We denote by Xk the limit martingale in 
Theorem 22 when Zq = S^. If pi < pi, we find similarly that for any 0 > 0, 


1 

an 


f {4>k,Yi{e)) 

eGE 


2 

A6» 


converges in to E(|Afcp A 9). From Theorem 22, the latter is positive if 9 is large enough. In 
the case pi = pi, by Theorem 22, we need to normalize by 


46 













For statement (iii), we use Lemma 27. We set t{G,v) := J2eeE-e2=v(^k,Yi{e)){4‘j,ye{e)). As 
above, we have t{G,v) < (p{G,v) = and Emax^ (/^(G, = 0((logn)®a®^) by Lemma 

29. The same upper bound holds for (p{T, o) from Lemma 23. It remains to apply Proposition 
36. 

For statement (iv), we have E{(j)k, Z 2 e){(l>jT Ze) = Zi){(j}j, Z^) = 0 by Lemma 27. We 

set t{G,v) = J2eGE-e2^vi^k,Y2iie)){(l}j,Yi{e)). We have 

e{G,v) < ip{G,v) = ^ S 2 i{e)Se{e) < S 2 i+i{v)Si+i{v). 

eGE:e2—v 

Moreover, Euiaxy ip{G,v)‘^ = 0((logn)®a^^^) by Lemma 29 and Cauchy-Schwarz inequality. 
The same upper bound holds for ip(T, o) from Lemma 23. It remains to apply Proposition 36. 

Finally, for statement (v), E{4>k, Z2t){4>k, Ze) = /i|,E((/)fe, = nl^{pk + o(l)). We use 

Pl\> a and then repeat the proof of statement (iv), we get 


E 


1 F2^(e))(</'fc) W(e)) 

rvn 


bGE 




= 0(a"^/2-3£/2^-7/2(l0g, 


Since k < y/d, the right hand side is o(l). 


□ 


We conclude this subsection by estimates on quantities which are closely related to B^B*^Xk- 
For e G E{V), we define for t > 0, A’t(e) = {f G E : d{e, /) = t}. For k G [r], we set 

i-i 

PkAe)=Y. E (76) 

‘=0 feytie) 

where 

Lkif) = Ak,Yt{9))Si-t-i{h), 

(g,h)^yiif)\yt{e)-,g^h 

and Yt{g), Si-t-i{h) = ||b^_t_i(/i)||i are the variables Yt{g), Si-t-i{E) defined on the graph 
G where all edges in (G, 62)4 have been removed. In particular, if {G,e) 2 i is a tree, Ys{g) and 
Ys{g) coincide for s <2i — t. 

We also define 

SkAA = Se{e)4>kia{ei)). (77) 

As can be seen from (60), when (G, 62 ) 2 ^ is a tree, it follows that 

B^B*^Xk{e) = PkAA + SkAA- (78) 


This is the reason why controls of quantities Pk,t and S'fcy, themselves based on the previous 
branching analysis of quantities Qk,t and Si, respectively, will be instrumental in our analysis of 
vectors B^B*^Xk- We have the following 

Proposition 38. Let i ^ Klog^, n with 0 < k < 7 / 5 . 

(i) For any k G [rp], there exists A > 0 such that w.h.p. 


1 PkAA 

an Z-^ uA 

edE ^ 


p'k- 
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(ii) For any k G H\[»"o]j there exists Ck > 0 such that w.h.p. 

<Ck. 


1 Pk,ei^) 

an a^^(logn) 

eG-E 


(in) For any k ^ j € [r], for some c > 0, 
1 


E 




eGE 




Proof. Let t > 0, be the Galton-Watson branching process defined in Section 8 started from 
Zq = and l has distribution (7r(l),..., 7r(r)). We denote by (T, o) the associated random 
rooted tree. For any k G [ro], by Theorem 25, for some positive constant 

nQli 


2t — P/c+o(l)- 


On the other hand, Theorem 25 also ensures that for some Ck > 0, for k G [»’]\[7’o]) 


E[Ql 


a^^ilog n)' 


< 2Cfc. 


(79) 


(80) 


Let us denote here by Ft the u-algebra spanned by (G, e)t, given Ft+i and g G (G, e)t+i. By 
a monotonicity argument, the statement of Lemma 29 applies to variables (S's(p), s > 0). Thus 
for any p > 1, there is a constant c > 0 such that for any integer s > 0 , E[,Ss(( 7 )p| 

We thus have, repeating the proof of Lemma 26, that for any fixed p > 1, 

EPkAef < = 0(a2«P). 

Then the argument in the proof of Proposition 37 can be applied. For statement (i), we let 
k G [ro] and define r(G, r) = ^ 


eGE,e 2 =v ^k,ei^)k‘k ' 

M(v) = max max max (Ss(u)/a^). 

0<t<lue{G,v)t s<2i-t 


Since ytf. > a, we have the rough bound 


i-i 


r(G,r) < a H 

eGE,e2—v 0 /G!Vt(e) 




= {M^{v)af Y X X 1 

e£E,e2—v y*—0 /^3^tCe) 

Hence t(G, v) < CM{v)^a^^ = p(G, v). However, we have by Lemma 29 that E max„ p(G, = 
0((logn)2‘^Qf®^). By Lemma 23, the same bound holds for (p{T,o). We deduce from Proposition 
36 that 


E 


1 ^Qk,e 

an ^ gf pf 

eGE 


= o(a5^/2„-7/2(iog„). 


Since k < 7 / 5 , the right hand side goes to 0 and statement (i) follows from (79). Statement (ii) 
is proved similarly using (80) . For statement (iii) , we use the above computation, together with 
Theorem 28 and (9). It gives the claimed bound. □ 
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9.4 Proof of Proposition 19 

For the next lemma, recall the definitions (76)-(77) of the vectors Pk^e and Sk,i in We also 
introduce the vector in for fc € [r], 

Nk/{e) = {(l)k,Yi{e)). 

Lemma 39. Leti ^ nlog^n withO < k < ^/\l/2.Then, w.h.p. \\B^Xk—^k,i\\ = 0((logn)®/^a^^/^) 
\\B‘^B*‘^Xk - Pk,e - Sk,i\\ = 0((logn)^a3^) and \\B^B*^Xk - PkA = 

Proof. Let be as in Proposition 33 and let Eg C Ei be the subset of edges such that (G, 62 
is a not tree. From (74), Lemma 29 and Proposition 33 we have w.h.p. 

\\Nk,e-BAkf = Y.{Ak:Yi{e))-{BAkAe)f 

eGS; 

< 2Y,{St{ef + {BAkAe?) 

e^E'^ 

= o{\E'f\\og{nfA^^ =0{{\ogn)^A^), 

where at the last line, we have used Lemma 30. Since k < 1/2, it proves the first statement. 

Similarly for the second statement, recall as stated in (78) that when (G, 62 ) 2 ^ is a tree, then 
B^B*^Xk{e) = Pk,i{e) + Sk/{e). Let c £21 be the subset of edges such that (G, 62 ) 2 ^ is 
not a tree. If G is 2£-tangle free then there are at most two different non-backtracking paths 
between two edges. Hence, if e S E'^g, 

\B^B*Akie)\ < 2Uk\\.o{PiAe) + Se{e)) < 2(Pi,,(e) + Se(e)). 

Now, by Lemma 29, w.h.p. Se{e) < C{\ogn)A. Moreover, if M = max„_(<^ S'((w)/a* < Glogn, 
Pi/A) < Et=o I]/Gyt(e) So finally, w.h.p. for all e e E^ 

\B^B*Ak{e)\ = Oi{lognfA^). 


Hence, by Lemma 30, w.h.p. , 

\\B^B*Ak - Pk,e - = Oi^/\i^\(\ogn)A’^^) = 0((logn)V^). 

On the other hand, from Proposition 37(i), w.h.p. 

Il^fc.tll = 0{yAA). (81) 

The conclusion follows since k < 1/2. □ 

All ingredients are finally gathered to prove Proposition 19. 

Proof of Proposition 19. We use the notation of Lemma 39. 

Proof of (i). Let k G [rg]. By definition 

. \\B^B*Ak\\ 

WBAkW • 
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From Proposition 37(i) and Proposition 38(i) respectively, for some positive constants co,ci, 

w.h.p. 


CO < 


II I 


< 2ci and 


2 “ 


< 2ci. 


2 

It remains to use Lemma 39 and the assumption /i| > a. We find w.h.p. 

\B^Xk\ 


Co < 




< Cl and 




(82) 


Proof of (iii). Let k G [c]\[ro]. From Proposition 37(ii) w.h.p. ||iVfe_f|| > (co/2)-yna^/^ and 
from Proposition 38(ii) w.h.p. ||Pfe,^|| = Using Lemma 39, we find w.h.p. 


Co < 


l|i?^Xfc|| 

v^a^/2 


and 


\\B^B*^Xk\\ 
^/n{\ogny/‘^a^ ~ ^ 


(83) 


Proof of (ii). Let k G [ro]. Since Px = a; is an isometry, 

{B^B*^Xk,B*^Xk) ^ {B^Xk.B^^Xk) 

^^k,Pk) \\B^B*^Xk\\\\B^Xk\\ 

In view of (82), it is sufficient to prove that for some co > 0, w.h.p. {B^Xk, B'^^Xk) > 
Note then that 


\{B^Xk,B^^Xk) - {Nk,e,Nk,2i)\ < ||B^Xfc||IlS^'xfc - iVfc.2dl + ll^'xfc - NkA\\Nk,2iim 

However, from (82) and Lemma 39, we have w.h.p. ||i?^Xfe||||H^^Xfe — Nk, 2 e\\ = o{y,^n). Also, 
from Proposition 37(i), and Lemma 39, we find w.h.p. ||i?^Xfe — ||||A^fc, 2 £|| = So 

finally, w.h.p. 

\{B^Xk,B'^^Xk) - {Nk,i,Nk, 2 i) \ = o{nfn). 

On the other hand, by Proposition 37(v), {Nkj, Nk, 2 i) is w.h.p. larger than co/i^^n for some 
Co > 0. It concludes the proof (ii). 

Proof of (iv). Let fik = k'kX ^/a. From (2) and (82)-(83) for k,j G [r], w.h.p. 


\{‘fj,‘fk) \ = 


{B^Xj,B^Xk)\ ^ \{B^Xj,B^Xk)\ ^ \{B%,B^Xk) 


< 


< 


cf^not^ 


ll-S^Xallll^^Xfell C5n/rj^^ 

In addition. Equations (82)-(83), Proposition 37(i) and Lemma 39 entail that w.h.p. 

\{B^Xj,B^Xk) - {Nj,e,Nk,,)\ < | 1 H\,||- N^A + \\bAj - ^aullll^fc.^ll = o(a^^/\logn 
From Proposition 37(iii), we get ii k j that w.h.p. 

\{f>„(pk)\ = 0(a3U2„-7/2(l0g^)5/2y 

Proof of (v). Let k j G [ro]. From (2) and (82), w.h.p. 


I (Ci 1 Pk)\ — 


\{BAj,B^Ak)\ ^ \{BAj,B^Ak)\ 


\B^B*^xA\\BAk\\ 


CT.na'^ 
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As in (84), we use \ {x,y) — {x',y')\ < ||a;'||||y — j/'|| + || 2 /|| ||x —x'||. We find from (82), Proposition 
37(i) and Lemma 39 that w.h.p. 

\{B^Xj,B^^Xk) - {Nj,i,Nk,2i)\ = 0(a"'v^(logn)5/2). 

Also, from Proposition 37(iv), w.h.p. Nk, 2 e) is 0(n^“'''/^a^^/^(logn)®/^). We conclude 

finally that 

|(0,^fc)|=o(A-'^/^(logn)^/^). 

Proof of (vi). We again use the same argument. Let k ^ j G [ro]. From (82), 

\{B^B*^Xk,B^B*^Xk)\ ^\{B^B*^Xk,B^B*^Xk)\ 
\\B^B*‘xk\\\\B^B*%\\ - clna^^ 

Recall that \{x,y) — {x',y')\ < ||a;'|||| 2 / — y'\\ + ||2/||||a; — x'\\. Then from (82), Proposition 38(i) 
and Lemma 39, we obtain w.h.p. 


\{B^B*^Xk,B^B*^X3) - {Pk,e + Sk,i,N,^i + Sk,e)\ = Oiilogn)^a^^Vn). 

Finally, from Proposition 38(iii), {Pk,e + Sk,e, Nj^i + Sk,i) is 0(n^“'''/^Q;®^/^(log n)®/^). □ 


10 Norm of non-backtracking matrices 

In this section we prove Proposition 20. The argument used for Erdos-Renyi graphs extends 
rather directly to the stochastic block model. 


10.1 Decomposition of 

In this paragraph, we essentially repeat the argument of subsection 5.2. We define, for u ^ v G 
V, the centered variable, 

—UV ^UV ^^(t(u)(t(v) ■ 

We now re-define K as the weighted non-backtracking matrix on the complete graph on V, for 
ejGE{V), 

Eef l(e y /)lFcr(ej)cr(e2) I 

where e —>■ / represents the non-backtracking property, 62 = fi and e ^ f~^. We also introduce 

2 

where e —> / means that there is a non-backtracking path with one intermediate edge between 
e and /. We define A^^), as in subsection 5.2. is now defined as 

t-i e 

{Pt )ef = JT—7a7s+l^'^(7t)cr(7t+i) JJ^7s7a + l! 

t +1 


where the set of paths Fttf is still defined as in Section 5.2. We again use the decomposition 


e En 4 . 7 . 7 . 


V n 


7l^a7. + l 


t + 1 
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to obtain 


qW = +1 y j^i2)^ie-t-i) 1 _r: 

71 71 71 71 ' 




W 


t=o 


We introduce 


and L = X( 2 ) _ IV_ 


(85) 


It now follows from (85) that when G is ^-tangle-free, 


n 

^-1 




( 86 ) 


+ ii:iis<'>ii+iiA('-«ii+^j:iiif!'>ii. 

where we have again let s\p := as in Section 5.2. We will now upper bound 

the above expression over all x such that {xj,B^x) = 0 . 


-Ve 

71^ ^ 


10.2 Proof of Proposition 20 

The proof of Proposition 20 parallels that of Proposition 11. 

The main task is to adapt the arguments of Section 6 to bound the norms || A(‘^|j, || A^^^XfcH, 
||S^*^||, II it' II, ||5'(^^||. We only the two main differences. 

First, the expressions (39)-(46)-(49) now depend on the types of the vertices involved in 
a path. We treat for example the case of (39) needed for Proposition 30. We claim that if 
7 € 114 ,m is a canonical path with e edges and v vertices, 

:nn4.„,.-.,,.„.,.,<(^)"'‘(;)""‘, (87) 

r i—1 s=l 

where the sum is over all injections r : [fc] —)■ [n], a = max^j Wij and an = ca + 0{n~'^) is 
defined in (69). Indeed, we consider a spanning tree of 7 , for the e — u -|- 1 edges not present in 
the spanning tree, we use the bound, EA^^ < W„{^n)a(v)l'ki < a/n for any p > 1 and u,v G [n]. 
For the remaining u — 1 edges, we take a leaf, say I, of the spanning tree of 7 , and denote its 
unique neighbor by g. Then, the injection r : [fc] —>■ [n] will give a label say i = a{T{g)) to g 
and j = a{T{l)) to 1. We use the bound that for any p > 1 and i G [n], 'n{j)Wij/n < otn- 

Hence, summing over all possible values of r(/) while fixing T{q), g ^ I, gives a factor of at most 
anjn in (87). We then remove I from the spanning tree and repeat this procedure v — 1 times, 
this yields (87). 

With (87) in place of (39), using Lemma 17 we then bound S given by (38) as follows 


< 


/cm+1 km 


v—3 e—v—1 


/Q^n" 


V n y 

* Vn/ 


fcm+1 km 


< na^n^ 




e—D+l 


10m(e—v+l)+8m^e——e—1 


n 


v—3 6=7! —1 

< na’^^{2emf^{em) 


/ {2£am) 


10m 


s=0 ^ 
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In the range of fc < £ and m defined by (35), we have + o(l)). We deduce that 

the bound (30) on || A|| continues to hold for the stochastic block model. Similarly, by the same 
adaptation of (46)-(49), we find that bounds (32) and (33) on and continue to 

hold for the stochastic block model. 

The second difference lies in the definition of the matrix L = — W. For the stochastic 

block model, from (10), the entry Lgf is zero unless e = f, e ^ f, f~^ —> e or e —> f~^. 
Moreover, the non-zero entries are bounded by a. Then the argument of the proof of bound 
(34) carries over easily. 

With bounds (30)-(32)-(33)-(34) available for the stochastic block model, the remainder of 
the proof of Proposition 20 repeats the argument of subsection 5.4. 


11 Stochastic Block Model : proof of Theorem 5 


The strategy of proof is based on the following lemma which asserts that the existence of a 
Boolean function non constant over the classes ensures the existence of an estimation with 
asymptotically positive overlap. 

Lemma 40. Assume that 7r(i) = Ijr and there exists a function F : V ^ {0, 1} of the graph G 
such that in probability, for any i G [r], 


lim - ^ l{^(^)=i}F(u) = —, 

n.—^nr, n ‘ ^ i \ / J j. 


V — 1 


where / : [r] —)■ [0,1] is not a constant function (there exists (i,j) such that f{i) ^ fij))- Let 
(/■'■, I~) be a partition of [r], such that 0 < |/+| < r and 


1 

W\ 


n /(*) > 

i^I+ 


1 


i^I- 


( 88 ) 


Then the following estimation procedure yields asymptotically positive overlap with permutation 
p in (16) egual to the identity : assign to each vertex v a label o'(v) picked uniformly at random 
from if F{v) = 1 and from I~ if F{v) = 0. 

Observe that the existence of a non trivial partition (/+,/“) satisfying (88) is implied by 
the assumption that / is not constant. 


Proof of Lemma 40. Let j G and v G V such that a{v) = j, then, given the realization 
of the graph, the event a{v) = cr(v) is equal to F'(w)e„, where £y is an independent Bernoulli 
{0, l}-random variable with P(e„ = 1) = l/jl+j. From the law of large numbers, we deduce 
that, in probability. 


1 

n 


n 

^cr(v)=j^a(v)=<j{v) 

V — 1 


f{i) 

r|/+| 


Summing over all j G , in probability. 


1 

n 


n 

^ ^ ^a{v)^I'^^a{v)—(7{v) 
v—1 


l± 

r 
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where /+ is the left hand side of (88). Similarly, if /_ is the right hand side of (88), in probability. 


1 

n 


n 


1 -/- 


Finally, in probability 

- ^ h(v)=c,{v) - - ^ -(/+ + 1 - - 1 ) = -(/+ - /-) > 0 , 

V — 1 


where the strict inequality comes from (88). 


□ 


Our aim is now to find a non constant functions over the classes which depends on the 
eigenvector To this end, we introduce a new random variable, for v € V, 


hA'^) = X! PkAA, 

e^E-.e^—v 

where Pk^i was defined by (76). Our first lemma is an extension of Proposition 38. 

Lemma 41. Let i ^ /tlog^n with 0 < k < yA 1/2, k S [ro] and i G [r]. There exists a random 
variable Yk^i such that = 0, ElYfe^/ < oo and for any continuity point t of the distribution 
of \Yk,i\, in Lf, 

1 ” 

V — 1 

Proof. Let Zt, t > 0, he the Galton-Watson branching process defined in Section 8 started from 
Zq = (5t and l has distribution (7r(l),..., 7r(r)). We denote by (T, o) the associated random 
rooted tree. Let D be the number of offspring of the root and for 1 < a; < O, let QkAA be the 
random variable Qk/ defined on the tree where the subtree attached to x is removed and set 


D 

x—1 


We observe that 

OO OO _Qj 

EJfey = — nE[Qk,t\D = n - 1] = V' ^ ^ E[Qk/\D = n-l] = aEQk,e.. 

n\ [n — 1 ! 

n—O n=l ^ ^ 

Also, by Theorem 25, the variable Qk,efJ-A^ — / {AkA ~ 1) converges in to a centered 

variable Xk satisfying EX^ < C. However, the variables Qk/ and ■Jk,i are closely related, 
indeed, 

i 

Jk,i = {D- 2)Ll, + {D-l)y2y2 ^k/ = {D- l)Qk,e - 

t—1u^Y° 

where was defined above (61). The inequality (64) for t = 0, shows that E\L°f. A = 
0{a^A- Hence, L% AA converges in Lf to 0. From Theorem 25, we find that Jk,efJ-A^ — 
Oip-kf’kiA/[Ak/^ ~ 1) converges weakly to a centered variable Yk satisfying ElYfej < C. In 
particular, if t is a continuity point of \Yk\, iAA) = AAl^k/t^A^ ~ o^Tk(l>k{i)/(Al/^ ~ 1)1 > i) 
converges weakly to 1(ct(o) = j)l(|Yfc| >t). It then remains to apply Proposition 36. □ 
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Our second lemma checks that we may replace Pk/ in the above statement by the eigenvector 
properly renormalized to be asymptotically close to (15). More precisely, we set 

h{v) = ^ sy/ni'k{e), 


where s = \/a^ and p'j, was defined in Proposition 38. 

Lemma 42. Let k € [ro], i € [r] and Vk^i be as in Lemma 41- For any eontinuity point t of the 
distribution of\Yk^i\, in LP', 

1 "■ 

7r(z)P(|yij^j I > f). 

v—1 

Proof. From Proposition 38(i) we find that, in probability. 


1 Pk,i 

n ^ 

e&E 


k,l ^ ^2^ 


We set ^fe(e) = Pk^t/{pkYSy/n). From Lemma 39, we have w.h.p. 


fk - Ck 


= 0(aVfc^^)=o(l). 


Also, from Theorem 4, w.h.p. 


iia-efcii = o(i). 

Hence, from the triangle inequality, w.h.p. 

ik-f.k =o(l)- 

We deduce from Cauchy-Schwartz inequality that w.h.p. 


1 


E 


. . , hA'i’) 

lk\0) 21 

n 


< 


Y.\^'k{e)-Ue) 


< 


m 


ik-A =o(i)- 


eeE 


Since t is a continuity point of |F/c,i|, it is then a routine to deduce Lemma 42 from Lemma 
41. □ 


All ingredients are now gathered to prove Theorem 5. We fix fc G [rg] as in Theorem 5 and 
let f'f. be as above. We set 

:= {i G [r] : 4>k{i) > 0} and J~ = [r] \ J^. 

From Lemma 42, there exist random variables Xj, j G [r] on M such that EXj = apLkfikij) / 

1) and the following holds for all j G [r]. With as above, for all t G K that is a continuity 
point of the distribution of Xj, the following convergence in probability holds: 

1 ” 

n E ^-yiv)=jhkiv)>t = > t). (89) 
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Write, for e = ±, tTe = 9e = Y.jeJ^ '^{j)4’k{j)- Note that g+ > 0 by definition of 

J+. Also by the orthogonality relation (9) between (pi = 1 and pk we obtain = 0, so 

that g- < 0. For e = ±, we shall denote by the random variable obtained as a mixture of 
the Xj for j G with weights Tr{j)/TT^. Note that Xg has mean ^g/TTg. 

We now establish the existence of to € M that is a continuity point of the distribution of 
both X_|_ and X_, and such that 

P(X+ > to) > P(X_ > to). (90) 


To this end, since EX+ > 0, we write 


^-|-oo p-\-oo /•-poo 

/ P(-^+ > t)dt > / P(—X+ > t)dt = / P(—A+ > t)dt. 

Jo Jo Jo 

The same argument yields 

(•+00 p+oo 

/ P(X_ > t)dt < / P(-X_ > t)dt. 

Jo Jo 

Combined, these two inequalities imply 



{[P(X+ >t)- P(X_ > t)] + [P(X+ > -t) - P(X_ > -t)]} dt > 0. 


Thus there is a subset of M+ of positive Lebesgue measure on which either P(X_|_ > t) > P(X_ > 
t) or P(X_|_ > —t) > P(X_ > —t). This implies the existence of a continuity point tg G ® of 
both X_|_, X_, —X+ and —X_ such that (90) holds. 

We may now come back to the eigenvector in Theorem 5. We set t = sto in Theorem 5. 
For some unknown sign uj G { — 1,1} we have 




(91) 


Case 1; the sign can be estimated. We first assume that w is known and = ^k- 
We consider the function 


F{v) - - l{A(ri>to}- 

From (90), (88) is satisfied with and we can apply Lemma 40 to obtain an asymptot¬ 

ically positive overlap. We note that the sign ui is easy to estimate consistently if the random 
variable X which is the mixture of the Xj with weights 7r(j) = 1/r is not symmetric. Indeed, 
in this case, for some bounded continuous function /, 

r r 

E/(X) = ^ 7r(j)E/(X,) + Y. = E/(-N). 

i=i i=i 

Then, from (89), given w, in probability, 

n 

lim - V f{ujlk{v)) = Ef{ujX) 

n—>-oo 77, 

v —1 

takes a different value for w = 1 and w = — 1. 
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Case 2: fully symmetric case. Another simple case is if X defined above is symmetric 
and |J+| = |J“|. If this occurs, A+ and — A_ have the same distribution. We consider the 
function F{v) = iiJ2e-e2=v ^kie) ^ > to) and the estimation where a vertex 

such that F{v) = 1 receives a uniform label in J^, and otherwise a label uniform in J~. 
By Lemma 40 applied to if w = 1, we obtain an positive overlap with and the 

permutation p in (16) equal to the identity. If w = —1, we obtain an positive overlap with 
and any permutation p in (16) such that p{J^) = 

Case 3: general case. In the general case, we may use the same idea : apply Lemma 
40 for a partition (/+,/“) which may depend on to but such that the cardinal of does not. 
First, from (89), the function fi{j) = V{Xj > to) is not constant on [r] and there exists ji such 
that 

We distinguish two subcases. The first case is when the function f-i{j) = V{—Xj > to) is also 
non-constant. Then there exists j_i such that 

f-iU-i) > ^-j- /-lO’)- 

j¥=3-l 

We consider the function F{v) = iiJ2e-e2=v ^k{e) > Tj^/n) = l(a;//j(u) > to) and the estimation 
where d'(v) = 1 if F(v) = 1 and d'(v) uniform on {2,... ,r} otherwise. We apply Lemma 40 to 
F and the partition J+ = {jui}, I~ = We obtain an asymptotically positive overlap 

for any permutation p in (16) such that p{jui) = 1- 

In the other case, /_i is constant and equal to say a. We introduce extra random independent 
variables u}'{v) G {—1,1} iid such that P(w'(u) = 1) = P(w'(u) = —1) = 1/2. We consider the 
function F{v) = l(w'(u) Cfc(e) > r/^n)- Then, by (89), in probability, 

^ > to) + «)• 

71 rOO iIj ^ 

V — 1 

Hence, it follows from (90) that (88) is satisfied with . We can then apply Lemma 40 

to obtain an asymptotically positive overlap. 

It concludes the proof of Theorem 5. 
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